Sunday, 24 July 2022

HB Donor Fragment Selection Themes

In this post I’ll look at a couple of fragment selection themes with a hydrogen bond donor (HBD) focus. The material has been taken from the recent ‘HBDs in drug design’ preprint (HBD3) which introduced the term ‘hydrogen bond donor-acceptor asymmetry’ and suggested that we need to think differently about HBDs and hydrogen bond acceptors (HBAs) in drug design. One example of these hydrogen bond donor-acceptor asymmetries is that HBAs are typically more strongly solvated than HBDs in aqueous media and this is especially relevant to lead optimization (as shown in the graphical abstract for HBD3 below). 

However, this post is about fragment selection, rather than fixing ADME, and so I’ll say something about differences between HBDs and HBAs in the context of binding to targets. Let’s suppose that you’d like to exploit an HBD in the binding site of your target. All you need to do is place an HBA at a point in space where it can form a good hydrogen bond (taking care to address issues like steric footprint and conformational energy) and you’ve got it sorted. However, life is not quite so simple if you’re trying to exploit an HBA in the binding site because the HBD (e.g., amide NH) that you present to it will almost invariably be accompanied by an HBA (e.g., amide carbonyl O). In contrast, it is relatively easy to design an HBA (e.g., pyridine N) into a ligand structure that is not accompanied by an HBD.   

In HBD3, I describe the HBA that accompanies pretty much every neutral HBD as ‘co-occurring’. The problem for designers is that the co-occurring HBA, which is likely to come with a larger desolvation penalty than that for the HBD, needs to be accommodated and this places constraints on design. It’s also more difficult to achieve ‘line-of-sight’ access with HBDs than is the case for HBAs (you’re likely to need line-of-sight access when targeting a polar atom at the bottom of a relatively narrow binding pocket). The following figure should give you a better idea of what I’m getting at and let’s assume that we’re trying to donate an HB to HBA sitting at the bottom of a narrow and otherwise non-polar binding pocket. Although each of the three structures has appropriate geometry for line-of-sight access, things are not likely to end well if you try to exploit this line-of-sight access in a real-life design situation.

Let’s start with the phenol and, although not pertinent to this discussion, it’s worth mentioning that hydroxyl groups are prone to conjugation in phase 2 metabolism (drugs get hydroxylated in phase 1 metabolism in order to facilitate clearance). Donation of an HB by a ligand hydroxyl to a target HBA also brings the hydroxyl oxygen (the co-occurring HBA) into proximity with the molecular surface of the target. This increases the likelihood of an energetic penalty resulting from desolvation of the phenolic oxygen. One subtle point is that donation of an HB by the phenolic hydroxyl increases the HB basicity of the oxygen which effectively increases the energetic cost of desolvating it.

The co-occuring HBA of the primary amide is an even bigger problem than for phenol because the high polarity of the carbonyl oxygen means that it carries a large desolvation penalty (bad news if you’re trying to hit an HBA at the bottom of a narrow and otherwise non-polar binding pocket). If this is not enough of a problem, you also need to worry about desolvation penalties associated with the second HBD (the primary amide has two HBDs and methyl-capping will take out the one that you need for hitting that HBA at the bottom of the binding pocket). As Lady Bracknell might have observed, “One desolvated polar atom may be regarded as a misfortune; to lose solvation of two polar atoms looks like carelessness”.

The last of the trio of structures is pyrazole linked at C4 and this avoids problems that might result from biasing the tautomeric preference. Pyrazole is a great warhead if you’re targeting a proximal HBD and HBA (as is the case when trying to hit a kinase hinge). However, pyrazole’s HBA may become a liability when trying to hit the HBA at the bottom of that otherwise non-polar binding pocket. Why not just take out pyrazole’s HBA, you might ask? The problem is that pyrroles are very electron rich and tend to be quite reactive.  One tactic is to move the co-occurring HBA from the ring to the linker (1 and 2) in a way that makes the linker electron-withdrawing and pray for a less destabilizing contact between the co-occurring HBA and the binding site. Alternatively, you can take out the co-occurring HBA and modify the linker to make it more electron-withdrawing (3). I’ve included Hammett σ values in the graphic and these will give you an idea how the substituents vary in their ability to suck electron density out of the pyrrole ring (beneficial both for making the pyrrole ring more rugged and increasing the HB acidity of its NH HBD).  I see these fragments as being of about the right size to be screened crystallographically but you might want something a bit larger than methyl if you’re using another detection method.

If you’re designing (or trying to improve the coverage of) a fragment library then another selection theme that you might want to think about is fragments that can present a high ‘density’ of HBDs to a target while minimizing the number of co-occurring HBAs. One way to do this is to use the guanidine substructure although this will cause some medicinal chemists to roll their eyes (concerns about permeability) while Ro3’s adherents would be likely to denounce you for heresy (actually not such a bad thing and I think that the late, great Denis Healey might have likened this to “being savaged by a dead sheep”). Guanidine itself is extremely basic (pKa = 13.6 | ref) which means very little of the neutral form for diffusing across membranes. However, the pKa of guanidine is also extremely sensitive to substitution and a number of approved drugs incorporate this substructure. I should also point out that, even in the neutral form of guanidine, the amide-like nitrogen atoms do not function as HBAs (even though they’d be counted as such when applying Ro5).

I’ve made a small selection of substituted guanidines that I think may of interest for screening as fragments. The pKa values that I quote in this post are from an article by two former colleagues (Peter Taylor and Alan Wait who are sadly both deceased) and this is an excellent source of measured guanidine pKa values.   Two of these (4 and 5) will be predominantly protonated at neutral pH although there’ll still be a significant amount of the neutral form that you’ll need for permeability. The other two guanidines will be predominantly neutral at neutral pH although 6 is sufficiently basic to protonate in lysosomes.  As for the pyrroles, I see these as about the right size to be screened crystallographically but you might want something a bit bigger than methyl if you plan to use a different detection method.


  


Friday, 1 April 2022

Enthalpic fragments

Enthalpy-driven binding has been presented as a rationale for screening fragments although some have argued that thermodynamic signature is actually a 'red herring' in the context of drug discovery.  Binding of a ligand grown from a fragment hit incurs a translational entropy penalty that is similar to that of the original fragment hit and it is therefore it is hardly surprising that synthetic elaboration results in binding that is more driven by entropy.

A recent collaborative study between researchers in the Budapest Enthalpomics Group (BEG) and Prof Wilhelmina Wiplasch, well known for her seminal study ‘The Ecstasy and Agony of Recreational PAINS’, shows this view to be hopelessly naïve. The mathematical treatment used in the study is formidable and was originally developed by Prof Wiplasch during a sabbatical at the Port-au-Prince Institute of Biogerontology. Briefly, deep learning was used to model the time-dependent covariance and kurtosis of the polarizability tensor for a series of rhodanines, showing that the enthalpic nature of fragment binding is caused by their greater ligand efficiencies. “This model comprehensively outperforms all competitors”, explains Group Leader Prof Kígyó Olaj, “and we have shown for the very first time that the Sackur-Tetrode equation can be safely consigned to the dustbin of History”.

Tuesday, 12 January 2021

Tom Lehrer's guide to design of SARS-CoV-3 main protease inhibitors for treatment of COVID-32

<< previous || next >>

It’s been ages since my last COVID-19 post (How not to repurpose a 'drug') and I’ll kick blogging off for 2021 with a follow up to an even older post (SARS-CoV-2 main protease. Crowdsourcing, peptidomimetics and fragments). I consider it unlikely that a SARS-CoV-2 main protease inhibitor, designed from scratch, will be available in time to have real impact on the current pandemic (in saying this, I’m making the huge assumption that defeat does not get snatched from the jaws of victory on the vaccination front). While many grinning Lean Six Sigma ‘belts’ (and their synchronously smiling allies in Human Resources) would denounce this as negative and defeatist, what I’m really getting at is that we need to think about targeting SARS-CoV-3 main protease when designing inhibitors for SARS-CoV-2 main protease. As Tom Lehrer advises in the intro to So Long, Mom, “If any songs are going to come from World War III, we better start writing them now”.

Happy New Year (this orchid opened during night of Dec 31/Jan 1)

If we’re designing a SARS-CoV-2 main protease inhibitor to also hit SARS-CoV-3 main protease then it’d be a good idea to engineer it to have greater affinity than necessary for the current target. In the fourth of his rules for air fighting, ‘Sailor’ Malan (readers may also be interested in his insights into fragment screening library design) asserts that “height gives you the initiative” which can be adapted for drug design as “affinity gives you the initiative”.  We should anticipate that inhibitors optimized against the current target will have lower affinity for the future target(s) although it’s obviously not a problem if this proves not to be the case. In any case, high affinity allows you to use a lower dose and that’s an important consideration if you’re planning for healthy people such as nurses and doctors to take the drug prophylactically in order to remain healthy. For a SARS-CoV-2 main protease inhibitor, I’d be looking at a target affinity of 1 nM (or better) which I believe would be achievable without causing too many self-appointed arbiters of 'compound quality' to spit feathers. Pfizer began a phase I study of the SARS-CoV-2 main protease inhibitor PF-00835321 (Ki = 0.27 nM; dosed intravenously as the phosphate pro-drug PF-07304814) in September 2020 although this compound had actually come from a discontinued SARS-CoV project. 

If we want to maximize the chances that a SARS-CoV-2 main protease inhibitor will exhibit comparative affinity for SARS-CoV-3 (or even SARS-CoV-4) then we need to exploit protein structural features that are likely to be conserved between the different main proteases. This points to milking as much activity as possible out of the core substructure of the inhibitor as a design strategy. With this in mind, I suggest that we really do need to exploit the catalytic cysteine if we’re serious about treating COVID-32 (or worried about SARS-CoV-2 main protease mutations). 

In drug design, we typically exploit a catalytic cysteine by forming a covalent bond between the thiol sulfur and an electrophilic atom in the molecular structure of the inhibitor (PF-00835321 uses the carbon of a carbonyl group to engage the catalytic cysteine). The functional group containing the electrophilic atom is commonly referred to as a “warhead” and covalent bond formation between cysteine can either be reversible or irreversible. Geometric constraints associated with covalent bond formation are typically a lot more stringent than for hydrogen bonds and you’ll make life much easier for yourself by getting the warhead into structures as early as possible in hit-to-lead. I generally recommend using reversible warheads in design of cysteine protease inhibitors (PF-00835321 binds reversibly to SARS-CoV-2) and present my reasoning in this document. In essence, irreversible inhibition adds complexity to design (both Ki and kinact need to be controlled) while placing greater technical demands on the design team (e.g. for generation of the structural models for transition states required for structure-based design).

The argument typically presented in support of irreversible inhibition (and slow binding kinetics) is that it leads to longer duration of action. This argument emphasizes benefits of slow (or zero) off-rate during the elimination phase while ignoring disadvantages of slow on-rate during the distribution phase and I’ll point you to an insightful article by my former colleague Rutger Folmer. While there will be situations in which irreversible inhibition really is the best option, the decision as to whether to go for reversible or irreversible inhibitors is one that should be carefully considered at the start of the project. In drug discovery, it usually ends in tears once the tail starts wagging the dog as would be the case if choice of screening tactics (covalent fragment screening typically finds irreversible binders) were to dictate lead optimization strategy. In particular, I wouldn't really recommend the laissez faire approach to project management (“once the rockets are up, who cares where they come down”) chronicled by Tom Lehrer.

Here's some information that may be of interest if you're selecting or designing warheads to form covalent bonds with catalytic cysteines. First, a couple of comparative studies of reversible and irreversible warheads. Second, some papain inhibition data taken from the literature ( B1977 | W1972 | L1971 ), summarized in the graphic below, that are relevant to fragment library design.

Off-target activity is always a concern in drug design since this can cause toxicity (it’s often considered politer to say “adverse drug reaction” rather than use the uncouth T-word although Tom Lehrer provides a useful perspective) and that’s a strong rationale for trying to achieve a low therapeutic dose. It’s my understanding (still wading through literature) that SARS-CoV-2 main protease functions in the endoplasmic reticulum which means that the relevant physiological pH is close to neutral. Many proteases (potential anti-targets for SARS-CoV-2 main protease inhibitors) function in acidic compartments such as lysosomes and the presence of a basic center in the molecular structure of an inhibitor will tend to draw it into these acidic compartments. When designing SARS-CoV-2 inhibitors, the safest option is simply to avoid basic centers (see F2005) . In particular, to link a ‘gratuitous’ basic center and an irreversible warhead would be to tempt launchpad misadventure.

I'll conclude the post with an observation that the COVID-19 pandemic seems to have triggered a parallel pandemic in scholarly publishing which is forcing scientists to be more creative in finding new ways to getting their messages to stand out. I'll let Tom Lehrer have the last word.   

Sunday, 9 August 2020

How not to repurpose a 'drug'

<< previous || next >>


I sometimes wonder what percentage of the pharmacopoeia will have been proposed for repurposing for the treatment of COVID19 by the end of 2020. In particular, I worry about the long-term, psychological effects on bloggers such as Derek who is forced to play whack-a-mole with hydroxychloroquine repurposing studies. Those attempting to use text mining and machine learning to prioritize drugs for repurposing should take note of the views expressed in this tweet

The idea behind drug repurposing is very simple. If an existing drug looks like it might show therapeutic benefits in the disease that you’re trying to treat then you can go directly to assessing efficacy in humans without having to do any of those irksome Phase I studies. However, you need to be aware that the approval of a drug always places restrictions on the dose that you can use and route of administration (for example, you can't administer a drug intravenously if it has only been approved for oral adminstration). One rationale for drug repurposing is that the target(s) for the drug may also have a role in the disease that you’re trying to treat. Even if the target is not directly relevant to the disease, the drug may engage a related target that is relevant with sufficient potency to have a therapeutically exploitable effect. While these rationales are clear, I do get the impression that some who use text-mining and machine learning to prioritize drugs for repurposing may simply be expecting the selected drugs to overwhelm targets with druglikeness. 

There are three general approaches to directly tackle a virus such as SARS-CoV-2 with a small molecule drug (or chemical agent). First, destroy the virus before it even sees a host cell and this is the objective of hand-washing and disinfection of surfaces. Second, prevent the virus from infecting host cells, for example, by blocking the interaction between the spike protein and ACE2. Third, prevent the virus from functioning in infected cells, for example, by inhibiting the SARS-CoV-2 main protease. One can also try to mitigate the effects of viral infection, for example, by using anti-inflammatory drugs to counter cytokine storm although I’d not regard this as tackling the virus directly.

In this post, I’ll be reviewing an article which suggests that quaternary ammonium compounds could be repurposed for treatment of COVID-19. The study received NIH funding and this may be of interest to researchers who failed to secure NIH funding. The article was received on 06-May-2020, accepted on 18-May-2020 and published on 25-May-2020. One of the authors of the article is a member of the editorial advisory board of the journal. As of 08-Aug-2020, two of the authors are described as cheminformatics experts in their Wikipedia biographies and one is also described as an expert in computational toxicology. 

The authors state: “This analysis identified ammonium chloride, which is commonly used as a treatment option for severe cases of metabolicalkalosis, as a drug of interest. Ammonium chloride is a quaternary ammonium compound that is known to also have antiviral activity (13,14) against coronavirus (Supplementary Material) and has a mechanism of action such as raising the endocytic and lysosomal pH, which it shares with chloroquine (15). Review of the text-mined literature also indicated a high-frequency of quaternary ammonium disinfectants as treatments for many viruses (Supplementary Material) (16,17), including coronaviruses: these act by deactivating the protective lipid coating that enveloped viruses like SARS-CoV-2 rely on.” 

Had I described ammonium chloride as a “quaternary ammonium compound” at high school in Trinidad (I was taught by the Holy Ghost Fathers), I’d have received a correctional package of licks and penance. For cheminformatics ‘experts’ to make such an error should remind us that each and every expert has an applicability domain and a shelf life. However, the errors are not confined to nomenclature since the cationic nitrogen atoms of a quaternary ammonium compound and a protonated amine are very different beasts. While a protonated amine can deprotonate in order to cross a lipid bilayer, the positive charge of a quaternary ammonium compound can be described as ‘permanent’ and this has profound consequences for its physicochemical behavior. First, the protonation state of a quaternary ammonium nitrogen does not change in response to a change in pH. This means that, unlike amines, quaternary ammonium compounds are not drawn into lysosomes and other acidic compartments. Second, the positive charge needs to be balanced by an anion (in some cases, this may be in the same covalent framework as the quaternary ammonium nitrogen). Despite being positively charged, the quaternary ammonium group is not as polar as you might think because it can’t donate hydrogen bonds to water. However, to get out of water it needs to take its counterion (which is typically polar) with it. I like to think about quaternary ammonium compounds (and other permanent cations) as hydrophobic blobs that are held in solution by the solvation of their counterions. A typical quaternary ammonium compound can also be considered as a detergent in which the polar and non-polar parts are not covalently bonded to each other. 

My view is that the antiviral ‘activity’ reported for ammonium chloride and chloroquine is a red herring when considering potential antiviral activity of quaternary ammonium compounds because neither has a quaternary ammonium center in its molecular structure. Nevertheless, I consider “raising the endocytic and lysosomal pH” to be an unconvincing ‘explanation’ for the antiviral ‘activity’ of ammonium chloride and chloroquine since one would anticipate analogous effects for any base of comparable pKa. One should also anticipate considerable collateral damage to result from raising the endocytic and lysosomal pH (assuming that the ‘drug’ is able overwhelm the buffering systems that have evolved to maintain physiological pH in live humans). The pH raising ‘explanation’ for antiviral ‘activity’ reminded me of suggestions that cancer can be cured by drinking aqueous sodium bicarbonate and I’ll direct readers to this relevant post by Derek. 

This brings us to cetylpyridinium chloride and miramistin shown below and I’ve included the structure of paraquat in the graphic. While miramistin does indeed have a quaternary ammonium nitrogen in its molecular structure, cetylpyridinium chloride is not a quaternary ammonium compound (the cationic nitrogen is only connected to three atoms) and would be more correctly referred to as an N-alkylpyridinium compound (or salt). Nevertheless, this is a less serious error than describing ammonium chloride as a quaternary ammonium compound because cetylpyridinium is, at least, a permanent cation. Neither cetylpyridinium chloride nor miramistin are quite as clean as the authors might have you believe (take a look at L1991 | L1996 | D2017 | K2020 | P2020). I’d expect an N-alkylpyridinium cation to be more electrophilic than a tetraalkylammonium cation and paraquat, with two N-alkylpyridinium substructures is highly toxic. Would Lady Bracknell's toxicity assessment have been that one N-alkylpyridinium may be regarded as a misfortune while two looks like carelessness?
I have no problem with hypothesizing that a chemical agent, such as cetypyridinium chloride, which destroys SARS-CoV-2 on surfaces could do the same thing safely when sprayed up your nose, into your mouth or down your throat. If tackling the virus in this manner, you do need to be thinking about the effects of the chemical agent on the mucus (which is believed to protect against viral infection). The authors assert that cetylpyridinium chloride “has been used in multiple clinical trials” although they only cite this study in which it was used in conjunction with glycerin and xanthan gum (claimed by the authors of the clinical study to “form a barrier on the host mucosa, thus preventing viral contact and invasion”).

The main challenge to a proposal that cetylpyridinium chloride be repurposed for treatment of COVID-19 is that the compound does not appear to have actually been conventionally approved (i.e. shown to be efficacious and safe) as a drug for dosing as a nasal spray, mouth wash or gargle. Another difficulty is that cetylpyridinium chloride does not appear to have a specific molecular target. Something that should worry readers of the article is that the authors make no reference to literature in which potential toxicity of cetylpyridinium chloride and quaternary ammonium compounds is discussed.

This is a good place to wrap up and, here in Trinidad's Maraval Valley, I'm working a cure for COVID-19. I anticipate a phone call from Stockholm later in the year.


Sunday, 2 August 2020

Why fragments?


Paramin panorama

Crystallographic fragment screens have been run recently against the main protease (at Diamond) and the Nsp3 macrodomain (at UCSF and Diamond) of SARS-Cov-2 and I thought that it might be of interest to take a closer look at why we screen fragments. Fragment-based lead discovery (FBLD) actually has origins in both crystallography [V1992 | A1996] and computational chemistry [M1991 | B1992 | E1994]. Measurement of affinity is important in fragment-to-lead work because it allows fragment-based structure-activity relationships to be established prior to structural elaboration. Affinity measurement is typically challenging when fragment binding has been detected using crystallography although affinity can be estimated by observation of the response of occupancy to concentration (the ∆G° value of −3.1 kcal/mol reported for binding of pyrazole to protein kinase B was derived in this manner).

Although fragment-based approaches to lead discovery are widely used, it is less clear why fragment-based lead discovery works as well as it appears to. While it has been stated that “fragment hits form high-quality interactions with the target”, the concept of interaction quality is not sufficiently well-defined to be useful in design. I ran a poll which asked about the strongest rationale for screening fragments.  The 65 votes were distributed as follows: ‘high ligand efficiency’ (23.1%), ‘enthalpy-driven binding’ (16.9%), ‘low molecular complexity’ (26.2%) and ‘God loves fragments’ (33.8%). I did not vote.

The belief is that fragments are especially ligand-efficient has many adherents in the drug discovery field and it has been asserted that “fragment hits typically possess high ‘ligand efficiency’ (binding affinity per heavy atom) and so are highly suitable for optimization into clinical candidates with good drug-like properties”. The fundamental problem with ligand efficiency (LE), as conventionally calculated, is that perception of efficiency varies with the arbitrary concentration unit in which affinity is expressed (have you ever wondered why Kd , Ki or IC50 has to be expressed in mole/litre for calculation of LE?). This would appear to be an rather undesirable characteristic for a design metric and LE evangelists might consider trying to explain why it’s not a problem rather than dismissing it as a “limitation” of the metric or trying to shift the burden of proof is onto the skeptics to show that the evangelists’ choice of concentration unit for calculation of LE is not useful.

The problems associated with the arbitrary nature of the concentration unit used to express affinity were first identified in 2009 and further discussed in 2014 and 2019. Specifically, it was noted that LE has a nontrivial dependency on the concentration,  C°, used to define the standard state. If you want to do solution thermodynamics with concentrations defined then you do need to specify a standard concentration. However, it is important to remember that the choice of standard concentration is necessarily arbitrary if the thermodynamic analysis is to be valid. If your conclusions change when you use a different definition of the standard state then you’ll no longer be doing thermodynamics and, as Pauli might have observed, you’ll not even be wrong. You probably don't know it, but when you use the LE metric, you’re making the sweeping assumption that all values of Kd, Ki and IC50 tend to a value of 1 M in the limit of zero molecular size. Recalling the conventional criticism of homeopathy, is there really a difference between a solute that is infinitely small and a solute that is infinitely dilute?

I think that’s enough flogging of inanimate equines for one blog post so let’s take a look at enthalpy-driven binding. My view of thermodynamic signature characterization in drug discovery is that it’s, in essence, a solution that’s desperately seeking a problem. In particular, there does not appear to be any physical basis for claims that the thermodynamic signature is a measure of interaction quality.  In case you’re thinking that I’m an unrepentant Luddite, I will concede that thermodynamic signatures could prove useful for validating physics-based models of molecular recognition and in, in specific cases, they may point to differences in binding mode within congeneric series. I should also stress that the modern isothermal calorimeter is an engineering marvel and I'd always want this option for label-free, affinity measurement in any project.

It is common to see statements in the thermodynamic signature literature to the effect that binding is ‘enthalpy-driven’ or ‘entropy-driven’ although it was noted in 2009 (coincidentally, in the same article that highlighted the nontrivial dependence of LE on C°) that these terms are not particularly meaningful. The problems start when you make comparisons between the numerical values of ∆H (which is independent of C°) and T∆S° (which depends on C°). If I’d presented such a comparison in physics class at high school (I was taught by the Holy Ghost Fathers in Port of Spain), I would have been caned with a ferocity reserved for those who’d dozed off in catechism class.  I’ll point you toward an article which asserts that, “when compared with many traditional druglike compounds, fragments bind more enthalpically to their protein targets”. I have a number of issues with this article although this is not the place for a comprehensive review (although I’ll probably pick it up in ‘The Nature of Lipophilic Efficiency’ when that gets written).

While I don’t believe that the authors have actually demonstrated that fragments bind more enthalpically than ligands of greater molecular size, I wouldn’t be surprised to discover that gains in affinity over the course of a fragment-to-lead (F2L) campaign had come more from entropy than enthalpy. First, the lost translation entropy (the component of ∆S° that endows it with its dependence on C°) is shared over greater number of intermolecular contacts for structurally-elaborated compounds and this article is relevant to the discussion. Second, I’d expect the entropy of any water molecule to increase when it is moved to bulk solvent from contact with molecular surface of ligand or target (regardless of polarity of the molecular surface at the point of contact). Nevertheless, this is something that you can test easily by examining the response of (∆H + T∆S°) to ∆G° (best to not to aggregate data for different targets and/or temperatures when analyzing isothermal titration calorimetry data in this manner). But even if F2L affinity gains were shown generally to come more from entropy than enthalpy, would that be a strong rationale for screening fragments?

This gets us onto molecular complexity and this article by Mike Hann and GSK colleagues should be considered essential reading for anybody thinking about selecting of compounds for screening. The Hann model is a conceptual framework for molecular complexity but it doesn’t provide much practical guidance as to how to measure complexity (this is not a criticism since the thought process should be more about frameworks and less about metrics). I don’t believe that it will prove possible to quantify molecular complexity in an objective manner that is useful for designing compound libraries (I will be delighted to be proven wrong on this point). The approach to handling molecular complexity that I’ve used in screening library design is to restrict extent of substitution (and other substructural features that can be considered to be associated with molecular complexity) and this is closer to ‘needle screening’ as described by Roche scientists in 2000 than to the Hann model.

Had I voted in the poll, ‘low molecular complexity’ would have got my vote.  Here’s what I said in NoLE (it’s got an entire section on fragment-based design and a practical suggestion for redefining ligand efficiency so that perception does not change with C°):

"I would argue that the rationale for screening fragments against targets of interest is actually based on two conjectures. First, chemical space can be covered most effectively by fragments because compounds of low molecular complexity [18, 21, 22] allow TIP [target interaction potential] to be explored [70,71,72,73,74] more efficiently and accurately. Second, a fragment that has been observed to bind to a target may be a better starting point for design than a higher affinity ligand whose greater molecular complexity prevents it from presenting molecular recognition elements to the target in an optimal manner."

To be fair, those who advocate the use of LE and thermodynamic signatures in fragment-based design do not deny the importance of molecular complexity. Let’s assume for the sake of argument that interaction quality can actually be defined and is quantified by the LE value and/or the thermodynamic signature for binding of compound to target. While these are massive assumptions, LE values and thermodynamic signatures are still effects rather than causes.

The last option for poll was ‘God loves fragments’ and more respondents (33.8%) voted for this than any of the first three options. I would interpret a vote for ‘God loves fragments’ in three ways. First, the respondent doesn’t consider any one of the first three options to be a stronger rationale for screening fragments than the other two. Second, the respondent doesn’t consider any of the first three options to be a valid rationale for screening fragments. Third, the respondent considers fragment-based approaches to have been over-sold.

This is a good place to wrap up. While I remain an enthusiast for fragment-based approaches to lead discovery, I do also believe that they have been somewhat oversold. The sensitivity of LE evangelists to criticism of their metric may stem from the use of LE to sell fragment-based methods to venture capitalists and, internally, to skeptical management. A shared (and serious) deficiency in the conventional ways in which LE and thermodynamic signature are quantified is that perception changes when the arbitrary concentration,  C°, that defines the standard state is changed. While there are ways in which this deficiency can be addressed for analysis, it is important that the deficiency be acknowledged if we are to move forward. Drug design is difficult and if we, as drug designers, embrace shaky science and flawed data analysis then those who fund our activities may conclude that the difficulties that we face are of our own making.     

Saturday, 18 July 2020

SARS-CoV-2 main protease. Crowdsourcing, peptidomimetics and fragments

<< previous || next >>

“Just take the ball and throw it where you want to. Throw strikes. Home plate don’t move.”

Satchel Paige (1906-1982) 

The COVID Moonshot and OSC19 are examples of what are sometimes called crowdsourced or open source approaches to drug discovery. While I’m not particularly keen on the use of the term ‘open source’ in this context, I have absolutely no quibble with the goal of seeking cures and treatments for diseases that are ignored by commercial drug discovery organizations. Open source drug discovery originated with OSDD in India and it should be noted that the approach has also been pioneered for malaria by OSM.  I see crowdsourcing primarily as a different way to organize and resource drug discovery rather than as a radically different way to do drug discovery.

One point that’s not always appreciated by cheminformaticians, computational chemists and drug discovery scientists in academia is that there’s a bit more to drug discovery than making predictions. In particular, I advise those seeking to transform drug discovery to ensure that they actually know what a drug needs to do and understand the constraints under which drug discovery scientists work. Currently, it does not appear to be possible to predict the effects of compounds in live humans from molecular structure with the accuracy needed for prediction-driven design and this is the primary reason that drug discovery is incremental in nature. A big part of drug discovery is generation of the information needed in order to maintain progress and there are gains to be had by doing this as efficiently as possible. Efficient generation of information, in turn, requires a degree of coordination that may prove difficult to achieve in a crowdsourced project.

The SARS-CoV-2 main protease (Mpro) is one of a number of potential targets of interest in the search for COVID-19 therapies. Like the cathepsins that are (or, at least, have been) of interest to the pharma/biotech industry as potential targets for therapeutic intervention, Mpro is a cysteine protease. If I’d been charged with quickly delivering an inhibitor of Mpro as a candidate drug then I’d be taking a very close look at how the pharma/biotech industry has pursued cysteine protease targets. Balacatib, odanacatib (cathepsin K inhibitors) and petesicatib (cathepsin S inhibitor) can each be described as a peptidomimetic with a warhead (nitrile) that forms a covalent bond reversibly with the catalytic cysteine.

A number of peptidomimetic Mpro inhibitors have been described in the literature and this blog post by Chris Southan may be of interest. I’ve been looking at the published inhibitors shown below in Chart 1 (which exhibit antiviral activity and have been subjected to pharmacokinetic and toxicological evaluation) and have written some notes on mapping the structure-activity relationship for compounds like these. I should stress that compounds discussed in these notes are not expected to be dramatically more potent than the two shown in Chart 1 (in fact, I expect at least one to be significantly less potent). Nevertheless, I would argue that assay results for these proposed synthetic targets would inform design.

My assessment of these compounds is that there is significant room for improvement and I think that it would be relatively easy to achieve a pIC50 of 8 (corresponding to an IC50 of 10 nM) using the aldehyde warhead. I’d consider taking an aldehyde forward (there are options for dosing as a prodrug) although it really would be much better if there was also the option to exchange this warhead for the nitrile (a warhead that is much-loved by industrial medicinal chemists since it’s rugged, polar and contributes minimally to molecular size). While I’d anticipate that replacement of aldehyde with nitrile will lead to a reduction in potency, it’s necessary to quantify the potency loss to enable the potential of nitriles to be properly assessed. The binding mode observed for 1 is shown below in Figure 1 and it’s likely that the groove region will need to be more fully exploited (this article will give you an idea of the sort of thing I have in mind) in order to achieve acceptable potency if the aldehyde warhead is replaced by nitrile.

The COVID Moonshot project currently appears to be in what many industrial drug discovery scientists would call the hit-to-lead phase.  In my view the principal objective of hit-to-lead work is to create options since having options will give the lead optimization team room to manoeuvre (you can think of hit-to-lead work as being a bit like playing in midfield). The COVID Moonshot project is currently focused on exploitation of hits from a fragment screen against MPro and, while I’d question whether this approach is likely to get to a candidate drug more quickly than the conventional structure-based design used in industry to pursue cathepsins, it’s certainly an interesting project that I’m happy to contribute to. It’s also worth mentioning that fragment screens have been run against SARS-CoV-2 Nsp3 macrodomain at UCSF and Diamond since there are no known inhibitors for this target.

Here’s a blog post by Pat Walters in which he examines the structure-activity relationships emerging for the fragment-derived inhibitors. Specifically, he uses a metric known as the Structure-Activity Landscape Index (SALI) to quantify the sensitivity of activity to structural changes. Medicinal chemists apply the term ‘activity cliff’ to situations where a small change in structure results in a large change in activity and I’ve argued that the idea of quantifying the sensitivity of a physicochemical effect to structural modifications goes all the way back to Hammett.  One point that comes out of Pat’s post is that it’s difficult to establish structure-activity relationships for low affinity ligands with a conventional biochemical assay. When applying fragment-based approaches in lead discovery, there are distinct advantages to being able to measure low binding affinity (~ 1 mM) since this allows fragment-based structure-activity relationships to be explored prior to synthetic elaboration of fragment hits. As Pat notes, inadequate solubility in assay buffer clearly places limits on the affinity that can be reliably measured in any assay although interference with the readout of a biochemical assay can also lead to misleading results. This is one reason that biophysical detection of binding using methods such as surface plasmon resonance (SPR) are favored in fragment-based lead discovery. Here’s an article by some of my former colleagues which shows how you can assess the impact of interference with the readout of a biochemical assay (and even correct for it if the effect isn’t too great).     

My first contribution to the COVID Moonshot project is illustrated in Chart 2 and the fragment-derived inhibitor 3 from which I started is also featured in Pat’s post. From inspection of the crystal structure, I noticed that the catalytic cysteine might be targeted by linking a ‘reversible’ warhead from the amide nitrogen (4 and 5). Although this might look fine on paper, the experimental data in this article suggest that linking any saturated carbon to the amide nitrogen will bias the preferred amide geometry away from trans to cis. Provided that the intrinsic gain in affinity resulting from linking the warhead is greater than the cost of adopting the bound conformation, the structural modification will lead to a net increase in affinity and the structures could be locked (here's an article that shows how this can work) into the bound conformation (e.g. by forming a ring).


In addition to being accessible to a warhead linked from the amide nitrogen of 3, the catalytic cysteine is also within striking distance of the carbonyl carbon and it would be prudent to consider the possibility that 3 and its analogs can function as substrates for Mpro. There is precedent for this type of behavior and I’ll point you toward an article that notes that a series of esters identified as cruzain inhibitors can function as substrates and more recent article that presents cruzain inhibitors that I’d consider to be potential substrates. A crystal structure of the protein-ligand complex is potentially misleading in this context since the enzyme might not be catalytically active. I believe that 6 could be used to explore this possibility since the carbonyl carbon would be expected to be more electrophilic and 3-hydroxy, 4-methylpyridine would be expected to be a better leaving group than its 3-amino analog.

This is a good point to wrap things up. I think that Satchel Paige gave us some pretty good advice on how to approach drug discovery and that's yet another reason that Black Lives Matter.

Wednesday, 27 May 2020

COVID-19 stuff

|| next >>

It’s been ages since the last blog post. I’d been thinking of marking my return with an April Fools post but this didn’t seem right given the seriousness of the COVID-19 pandemic. However, I do realize that many people only follow the blog for the April Fools posts so I’ll link them here for easy reference [2013 | 2015 | 2016 | 2017 | 2018 | 2019]. I’m currently in Trinidad so I’ll share a photo from Berwick-on-Sea, on Trinidad's north coast (and the correspondence address for a two [ K2017 | K2019 ] of my more controversial articles). 


I should say at the outset that I’ve never previously worked in antiviral area nor tried to help fight a global pandemic. X-ray crystal structures had been published for the main protease of SARS-Cov-2 back in March and these generated some discussion on twitter with Martin Stoermer and Ash Jogalekar (who actually triggered it). The upshot of the discussion was that the discussion was that a hydrogen bond between protein and ligand appeared to be of suboptimal geometry. Martin and I wrote a short article which we uploaded to figshare and Martin also did a blog post. I’ve decided to post my contributions to the COVID-19 response on figshare rather than cluttering ChemRxiv and bioRxiv with preprints that I have no intention of ever submitting to a journal. I should point out the main protease is just one of a number of SARS-CoV-2 targets that one might exploit and I’ll direct you to this helpful review.

The two inhibitors that Martin and I wrote about are both peptidomimetics and each inhibitor structure incorporates a warhead which can form a covalent bond with the catalytic cysteine sulfur. I was particularly interested in the inhibitor with the 𝞪-ketoamide warhead because the inhibition would be expected to be reversible (always a good idea to check though) and I’ll get on to why that’s significant a bit later in the post. When I examine a crystal structure, I first look for what, out of laziness, I’ll call ‘weaknesses’ in the binding mode. These ‘weaknesses’ can be local as is the case for contact between polar and non-polar regions of molecular surface or a hydrogen bond with less than ideal geometry. However, ‘weaknesses’ can also be non-local when a ligand binds in a form (protonation state, tautomer, conformer) that is relatively high in energy. Generally, ‘weaknesses’ in binding modes should always be seen as design opportunities, especially when they are non-local, and here’s an example of how recognition of instability of the bound conformation was used in fragment-based design of PTP1B inhibitors.

It can be helpful to think in terms of design themes when optimizing both hits and leads. Typically, there is insufficient data for building useful predictive models at the start of a project and the optimization process involves efficient generation of the information required for making decisions. As such optimization of both hits and leads should be seen in a Design of Experiments framework. After seeking insights from BB (my mother's dog), I wrote up some design themes.


A crystallographic fragment screen has been run against SARS-CoV-2 and a number of electrophilic fragments were screened using mass spectroscopy. These two screens serve as a launch pad for the COVID Moonshot which looks interesting (although I’d suggest easing off a bit on the propaganda). One limitation of crystallographic fragment screening is that it is very difficult to measure the affinity of fragments which means that it is not generally feasible to explore the structure-activity relationships of fragments prior to structural elaboration. That said, it’s not impossible and I’ll point you to this article which reports a value of -3.1 kcal/mol for the free energy of binding of pyrazole to protein kinase B that was derived from the concentration response of occupancy. The results of the crystallographic screen also have implications for the design of peptidomimetic inhibitors (in particular, the results point to pyridine as a bioisostere for the pyrrolidinone that is commonly used as a P1 substituent) and these some notes may be helpful. 

Reversibility is an issue that you definitely need to be aware of when designing compounds to inhibit cysteine proteases and these notes may be helpful. The issue arises because formation of a covalent bond between an electrophilic center (commonly referred to as a ‘warhead’) and the thiol of the catalytic cysteine is a commonly used tactic in inhibitor design. I'll direct you to a review of covalent drugs, an article that discusses some of the things that you need consider when working with covalent inhibitors and a blog post on approved covalent drug mechanisms. There does appear to be a degree of prejudice [R1997 | BH2010 | BW2014] against covalent inhibition and some even appear to be unaware that covalent inhibition can be reversible.

If designing covalent cysteine protease inhibitors, I would generally favor reversible inhibition over irreversible inhibition. My primary reason for taking this view is that design of reversible inhibitors is less complex because IC50 can be interpreted in terms of affinity and you can use pretty much the same structure-based approaches as you would for non-covalent inhibitors. You can't really interpret IC50 for an irreversible inhibitor and the enzyme will be 100% inhibited if it's in contract with an irreversible inhibitor for long enough. The inhibitory activity of irreversible inhibitors is typically quantified by the ratio of the inactivation rate constant (kinact) to the inhibition constant (Ki) which makes the enzyme inhibition assay more complex for irreversible inhibitors. Furthermore, you'll need to build transition state models in order to do structure-based design.

It is possible that irreversible inhibition could lead longer duration of action although you also need to consider the consequences of slow inactivation of the enzyme. If thinking along these lines, you should look at this article by Rutger Folmer. Generally, the decision to go for reversible or irreversible inhibitors is one that drug discovery teams should think through carefully and the decision should determine screening tactics (rather than vice versa).