X-ray screening identifies active site and allosteric inhibitors of SARS-CoV-2 main protease

See allHide authors and affiliations

Science  02 Apr 2021:
DOI: 10.1126/science.abf7945


The coronavirus disease (COVID-19) caused by SARS-CoV-2 is creating tremendous human suffering. To date, no effective drug is available to directly treat the disease. In a search for a drug against COVID-19, we have performed a high-throughput X-ray crystallographic screen of two repurposing drug libraries against the SARS-CoV-2 main protease (Mpro), which is essential for viral replication. In contrast to commonly applied X-ray fragment screening experiments with molecules of low complexity, our screen tested already approved drugs and drugs in clinical trials. From the three-dimensional protein structures, we identified 37 compounds that bind to Mpro. In subsequent cell-based viral reduction assays, one peptidomimetic and six non-peptidic compounds showed antiviral activity at non-toxic concentrations. We identified two allosteric binding sites representing attractive targets for drug development against SARS-CoV-2.

Infection of host cells by SARS-CoV-2 is governed by the complex interplay of molecular factors from both the host and the virus (1, 2). Coronaviruses are RNA-viruses with a genome of approximately 30,000 nucleotides. The viral open-reading frames are expressed as two overlapping large polyproteins, which must be separated into functional subunits for replication and transcription activity (1). This proteolytic cleavage is primarily accomplished by the main protease (Mpro), also known as 3C-like protease 3CLpro or nsp5. Mpro cleaves the viral polyprotein pp1ab at eleven distinct sites. The core cleavage motif is Leu-Gln↓(Ser/Ala/Gly) (1). Mpro possesses a chymotrypsin-like fold appended with a C-terminal helical domain, and harbors a catalytic dyad comprised of Cys145 and His41 in its active site, which is formed by four major pockets that are labeled according to their position relative to the scissile bond of the substrate (Fig. 1) (1). The active site is located in a cleft between the two N-terminal domains of the three-domain structure of the monomer, while the C-terminal helical domain is involved in regulation and dimerization of the enzyme (Fig. 1A). Due to its central involvement in virus replication, Mpro is recognized as a prime target for antiviral drug discovery and compound screening activities aiming to identify and optimize drugs which can tackle coronavirus infections (3). Indeed, a number of recent publications confirm the potential of targeting Mpro for inhibition of virus replication (1, 2, 4).

Fig. 1 X-ray screening of drug-repurposing libraries reveals compound binding sites distributed across the complete Mpro surface.

(A) Schematic drawing of Mpro dimer structure. Protomer A in white, protomer B in red. For clarity, the 29 binding compounds (yellow sticks) are only depicted on one of the two protomers. Catalytic residues H41 and Cys145, active site and two allosteric drug binding sites are highlighted. (B) Close-up view of active site with peptide substrate bound (blue sticks), modeled after SARS-CoV Mpro (PDB 2Q6G). Scissile bond is indicated in yellow and with green arrow. Substrate binding pockets S1’, S1, S2 and S4 are indicated by colors.

In order to find drug candidates against SARS-CoV-2, we performed a large-scale X-ray crystallographic screen of Mpro against two repurposing libraries containing 5953 unique compounds from the “Fraunhofer IME Repurposing Collection” and the “Safe-in-man” library from Dompé Farmaceutici S.p.A. (5).

In contrast to crystallographic fragment-screening experiments, repurposing libraries are chemically more complex (fig. S1A) (6, 7). Thus they likely bind more specifically and with higher affinity (8). Due to the higher molecular weights, we performed co-crystallization experiments at a physiological pH-value of 7.5 instead of compound soaking into native crystals (9).

From the 5953 unique compounds in our screen, we obtained X-ray diffraction datasets from 2381 unique compounds, which were subjected to automated structure refinement followed by cluster analysis (10) and pan dataset density analysis (PanDDA) (11) (table S1). We observed additional electron-density, indicating binding to Mpro, for 43 compounds, which were classified as hits, representing 37 unique compounds (tables S1, S2, and S3). From these, the binding mode could be unambiguously determined for 29 molecules (Fig. 1A and table S4). The majority of hits were found in the active site of the enzyme. Of the 16 active-site binders, six covalently bind as thioethers to Cys145, one compound binds covalently as a thiohemiacetal to Cys145, one is zinc-coordinated and eight bind non-covalently. The remaining 13 compounds bind outside the active site at various locations (Fig. 1A).

Of the 43 hits from our X-ray screen, 37 compounds were available in quantities required for testing their antiviral activity against SARS-CoV-2 in cell assays (table S2). Nine compounds, that reduced viral RNA (vRNA) replication by at least two orders of magnitude in Vero E6 cells (fig. S2), were further evaluated to determine the effective concentrations that reduced not only vRNA but also SARS-CoV-2 infectious particles by 50% (EC50) (Fig. 2). Additionally, AT7519 and ifenprodil, which showed slightly lower vRNA-level reduction, were included due to their distinct binding sites outside of the active site. From these eleven, seven compounds (AT7519, calpeptin, ifenprodil, MUT056399, pelitinib, tolperisone, triglycidyl isocyanurate) exhibited at least one hundredfold reduction in infectious particles in combination with either selectivity indices (SI = CC50 / EC50) greater than five or no cytotoxicity in the tested concentration range and are considered antivirally active (table S5).

Fig. 2 Effect of selected compounds on SARS-CoV-2 replication in Vero E6 cells.

The vRNA yield (solid circles), viral titers (half-solid circles), and cell viability (empty circles) were determined by RT-qPCR, immunofocus assays, and the CCK-8 method, respectively. EC50 for the viral titer reduction is shown. Individual data points represent mean ± SD from three independent replicates in one experiment.

In the following we focus on a more detailed description of the eleven compounds analyzed in the secondary screen, which are grouped according to their different binding sites. The remaining hits are described in the supplementary text and figs. S3 to S5.

Tolperisone, HEAT and isofloxythepin bind covalently to the active site. Tolperisone is antivirally active (EC50 = 19.17 μM) and shows no cytotoxicity (CC50 > 100 μM) (Fig. 2), whereas HEAT (EC50 = 24.05 μM, CC50 = 55.42 μM) and isofloxythepin (EC50 = 4.8 μM, CC50 = 17 μM) show unfavorable cytotoxicity. For all three compounds, only breakdown products are observed in the active site. Tolperisone and HEAT are β-aminoketones, but we only observe the part of the drug containing the ketone (2,4'-dimethylpropiophenone and 2-methyl-1-tetralone), while the remaining part with the amine group is missing. The breakdown product binds as a Michael acceptor to the thiol of Cys145, independently confirmed for HEAT by mass spectrometry (fig. S6 and table S6). The decomposition of tolperisone and HEAT was detected in both the crystallization and cell culture conditions (fig. S7) and is reported to be pH-dependent (12). The parent compounds can be regarded as pro-drugs (13, 14). In the X-ray structures the aromatic ring systems of tolperisone (Fig. 3A) and HEAT (Fig. 3B) protrude into the S1 pocket and form van der Waals contacts with the backbone of Phe140 and Leu141 and the side chain of Glu166. In addition, the keto group accepts a hydrogen bond from the imidazole side chain of His163. Tolperisone is used as a skeletal muscle relaxant (15). The X-ray structure suggests that isofloxythepin binds similarly as a fragment to Cys145 (Fig. 3C).

Fig. 3 Covalent and non-covalent binders in the active site of Mpro.

Bound compounds are depicted as colored sticks while the surface of Mpro is shown in grey with selected interacting residues as sticks. Substrate binding pockets are colored as in Fig. 1. Hydrogen bonds are depicted by dashed lines. (A) tolperisone. (B) HEAT, (C) isofloxythepin, (D) triglycidyl isocyanurate, (E) calpeptin, (F) MUT056399.

Triglycidyl isocyanurate has antiviral activity (EC50 = 30.02 μM, CC50 > 100 μM) and adopts a covalent and non-covalent binding mode to the active site. In both modes, the compound’s central ring sits on top of the catalytic dyad (His41, Cys145) and its three epoxypropyl substituents reach into subsites S1’, S1 and S2. The non-covalent binding mode is stabilized by hydrogen bonds to the main chain of Gly143 and Gln166, and to the side chain of His163. In the covalently bound form, one oxirane ring is opened by nucleophilic attack of Cys145 forming a thioether (Fig. 3D). Triglycidyl isocyanurate has been tested as an antitumor agent (16).

Calpeptin shows the highest antiviral activity in the screen (EC50 = 72 nM, CC50 > 100 μM). It binds covalently via its aldehyde group to Cys145, forming a thiohemiacetal. This peptidomimetic inhibitor occupies substrate pockets S1 to S3, similar to the peptidomimetic inhibitors GC-376 (17, 18), calpain inhibitors (19), N3 (2), and the α-ketoamide 13b (1). The peptidomimetic backbone forms hydrogen bonds to the main chain of His164 and Glu166, whereas the norleucine side chain maintains van der Waals contacts with the backbone of Phe140, Leu141 and Asn142 (Fig. 3E). Calpeptin has known activity against SARS-CoV-2 Mpro in enzymatic assays (17). The structure is highly similar to the common protease inhibitor leupeptin (fig. S3A), which served as a positive control in our X-ray screen but was not further tested in antiviral assays. In silico docking experiments also suggested calpeptin as a possible Mpro binding molecule (table S7). Calpeptin also inhibits cathepsin L (20) and dual targeting of cathepsin L and Mpro is suggested as attractive path for SARS-CoV-2 inhibition (19).

MUT056399 binds non-covalently to the active site (EC50 = 38.24 μM, CC50 > 100 μM). The diphenyl ether core of MUT056399 blocks access to the catalytic site consisting of Cys145 and His41. The terminal carboxamide group occupies pocket S1 and forms hydrogen bonds to the side chain of His163 and the backbone of Phe140 (Fig. 3F). The ethyl-phenyl group of the molecule reaches deep into pocket S2, which is enlarged by a shift of the side chain of Met49 out of the substrate binding pocket. MUT056399 was developed as an antibacterial agent against multidrug-resistant Staphylococcus aureus strains (21).

Quipazine maleate showed moderate antiviral activity (EC50 = 31.64 μM, CC50 > 100 μM). In the X-ray structure, only the maleate counterion is observed covalently bound as a thioether (supplementary text and fig. S3B). Maleate is observed in structures of six other compounds showing no antiviral activity. The observed antiviral activity is thus likely caused by an off-target effect of quipazine.

In general, the enzymatic activity of Mpro relies on the architecture of the active site, which critically depends on the dimerization of the enzyme and the correct relative orientation of the subdomains. This could allow ligands that bind outside of the active site to affect activity. In fact, we discovered two such allosteric binding sites of Mpro.

Five compounds of our X-ray screen bind in a hydrophobic pocket in the C-terminal dimerization domain (Fig. 4, A and B), located close to the oxyanion hole in pocket S1 of the substrate-binding site. One of these showed strong antiviral activity (Fig. 2). Another compound binds in between the catalytic and dimerization domains of Mpro.

Fig. 4 Screening hits at allosteric sites of Mpro.

(A) Close up view of the binding site in the dimerization domain (protomer A, grey cartoon representation), close to the active site of the second protomer (protomer B, surface representation) in the native dimer. Residues forming the hydrophobic pocket are indicated. Pelitinib (dark green) binds to the C-terminal α-helix at Ser301 and pushes against Asn142 and the β-turn of the pocket S1 of protomer B (residues marked with an asterisk). The inset shows conformational change of Gln256 (grey sticks) compared to Mpro apo structure (white sticks). (B) RS-102895 (purple), ifenprodil (cyan), PD-168568 (orange) and tofogliflozin (blue) occupy the same binding pocket as pelitinib. (C) AT7519 occupies a deep cleft between the catalytic and dimerization domain of Mpro. (D) Conformational changes in the AT7519 bound Mpro structure (grey) compared to the apo structure (white).

Central to the first allosteric binding site is a hydrophobic pocket formed by Ile213, Leu253, Gln256, Val297 and Cys300 within the C-terminal dimerization domain (Fig. 4A). Pelitinib, ifenprodil, RS-102895, PD-168568 and tofogliflozin all exploit this site by inserting an aromatic moiety into this pocket.

Pelitinib shows the second highest antiviral activity in our screen (EC50 = 1.25 μM, CC50 = 13.96 μM). Its halogenated benzene ring binds to the hydrophobic groove in the helical domain which becomes accessible by movement of the Gln256 side chain (Fig. 4A). The central 3-cyanoquinoline moiety interacts with the end of the C-terminal helix (Ser301). The ethyl ether substituent pushes against Tyr118 and Asn142 (from loop 141-144 of the S1 pocket) of the opposing protomer within the native dimer. The integrity of this pocket is crucial for enzyme activity (22). Pelitinib is an amine-catalyzed Michael acceptor (23), developed as an anticancer agent to bind to a cysteine in the active site of the tyrosine kinase epidermal growth factor receptor inhibitor (24). But from its observed binding position it is impossible for it to reach into the active site and no evidence for covalent binding to Cys145 is found in the electron-density maps.

Ifenprodil and RS-102895 bind to the same hydrophobic pocket in the dimerization domain as pelitinib (Fig. 4B; fig. S4, A and B; and supplementary text). Only ifenprodil (EC50 = 46.86 μM, CC50 > 100 μM) shows moderate activity. RS-102895 (EC50 = 19.8 μM, CC50 = 54.98 μM) interacts, similar to pelitinib, with the second protomer by forming two hydrogen bonds to the side and main chains of Asn142 while the other compounds exhibit weaker or no interaction with the second protomer. PD-168568 and tofogliflozin bind the same site but are inactive (Fig. 4B and fig. S4, C and D).

The second allosteric site is formed by the deep groove between the catalytic domains and the dimerization domain. AT7519 is the only compound in our screen that we identified bound to this site (Fig. 4C). Though it has only moderate activity, we discuss it here because this site may be a target. The chlorinated benzene ring is engaged in various van der Waals interactions to loop 107-110, Val202, and Thr292. The central pyrazole has van der Waals contacts to Ile249, Phe294 and its adjacent carbonyl group forms a hydrogen bond to the side chain of Gln110. The terminal piperidine sits on top of Asn151 and forms hydrogen bonds to the carboxylate of Asp153. This results in a displacement of loop 153-155, slightly narrowing the binding groove. The Cα-atom of Tyr154 moves by 2.8 Å, accompanied by a conformational change of Asp153 (Fig. 4D). This allows hydrogen bonding to the compound and the formation of a salt-bridge to Arg298. Arg298 is crucial for dimerization (25). The mutation Arg298Ala causes a reorientation of the dimerization domain relative to catalytic domain, leading to changes in the oxyanion hole and destabilization of the S1 pocket by the N terminus. AT7519 was evaluated for treatment of human cancers (26). The potential of allosteric inhibition of Mpro through modulation of Arg298 has been independently demonstrated by mass spectrometry (27).

Our X-ray screen revealed 43 compounds binding to Mpro, with seven compounds showing antiviral activity against SARS-CoV-2. We present structural evidence for interaction of these compounds at active and allosteric sites of Mpro, although we may not exclude that off-target effects played a role in the antiviral effect in cell culture, in particular for compounds with low selectivity index. Vice versa, missing antiviral activity of compounds binding clearly to Mpro in the crystal might be due to rapid metabolization in the cellular environment. Calpeptin and pelitinib showed strong antiviral activity with low cytotoxicity and are suitable for preclinical evaluation. In any case all hit compounds are valuable lead structures with potential for further drug development, especially since drug-repurposing libraries offer the advantage of proven bio-activity and cell-permeability (28).

The most active compound, calpeptin binds in the active site similar to other members of the large class of peptide-based inhibitors that bind as thiohemi-acetals or -ketals to Mpro (29). In addition to this peptidomimetic inhibitor, we discovered several non-peptidic inhibitors. Those compounds binding to the active site of Mpro contained new Michael acceptors based on β-aminoketones (tolperisone and HEAT). These lead to the formation of thioethers and have not been described as prodrugs for viral proteases. We also identified a non-covalent binder, MUT056399, blocking the active site. Besides this common active-site inhibition, we discovered compounds that inhibit the enzyme through binding at two allosteric sites of Mpro.

The first allosteric site (dimerization domain) is in direct vicinity of the S1 pocket of the adjacent monomer within the native dimer. The potential for antiviral inhibition through this site is demonstrated by pelitinib. The hydrophobic nature of the residues forming the main pocket is conserved in all human coronavirus Mpro (fig. S8). Consequently, potential drugs targeting this binding site may be effective against other coronaviruses. The potential of the second allosteric site as a druggable target is demonstrated by the observed moderate antiviral activity of AT7519.

Supplementary Materials

Materials and Methods

Supplementary Text

Figs. S1 to S9

Tables S1 to S7

References (3054)

MDAR Reproducibility Checklist

This is an open-access article distributed under the terms of the Creative Commons Attribution license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

References and Notes

Acknowledgments: We acknowledge Deutsches Elektronen-Synchrotron (DESY, Hamburg, Germany), a member of the Helmholtz Association HGF, for the provision of experimental facilities. Parts of this research were carried out at PETRA III at beamline P11. Further MX data were collected at beamline P13 and P14 operated by EMBL. We thank the DESY machine group, in particular Mario Wunderlich, Kim Heuck, Arne Brinkmann, Olaf Goldbeck, Jürgen Haar, Torsten Schulz, Gunnar Priebe, Maximilian Holz, Björn Lemcke, Klaus Knaack, Oliver Seebauer, Philipp Willanzheimer, Rolf Jonas, and Nicole Engling. We thank Thomas Dietrich, Simon Geile, Filip Guicking, Heshmat Noei, and Tim Pakendorf from DESY, and Bianca Di Fabrizio and Sebastian Kühn from BNITM for assistance. This research was supported in part through the Maxwell computational resources operated at DESY. We acknowledge the use of the XBI biological sample preparation laboratory at European XFEL, enabled by the XBI User Consortium. Funding: We acknowledge financial support from the EXSCALATE4CoV EU-H2020 Emergency Project (101003551), the Cluster of Excellence “Advanced Imaging of Matter” of the Deutsche Forschungsgemeinschaft (DFG) - EXC 2056 - project ID 390715994, the Helmholtz Association Impulse and Networking funds (projects ExNet-0002 and InternLabs-0011 “HIR3X”), the Federal Ministry of Education and Research (BMBF) via projects 05K16GUA, 05K19GU4, 05K20BI1, 05K20FL1, 16GW0277 and 031B0405D), and the Joachim-Herz-Stiftung Hamburg (project Infecto-Physics). CE and MR acknowledge financial support from grant-No. HIDSS-0002 DASHH (Data Science in Hamburg - HELMHOLTZ Graduate School for the Structure of Matter). RC is supported by DFG grants INST 187/621-1 and INST 187/686-1. DT is supported by the Slovenian Research Agency (ARRS; research program P1-0048, Infrastructural program IO-0048). BS was supported by an Exploration Grant from the Boehringer Ingelheim Foundation. The Heinrich Pette Institute, Leibniz Institute for Experimental Virology was supported by the Free and Hanseatic City of Hamburg and the Federal Ministry of Health. CU and BK were supported by EU Horizon 2020 ERC StG-2017 759661, BMBF RTK Struktur 01KI20391, BMBF Visavix 05K16BH1 and the Leibniz Association SAW-2014-HPI-4 grant. Author contributions: SeG, PR, YFG, WB, PG, ARB, RC, DT, AZ, HNC, ARP, CB, AM designed research. SeG, PR, TJL, WH, HNC, ARP, CB, AM wrote manuscript. SeG, PR, JL, FHMK, SM, WB, ID, BS, HGie, BNB, MB, PLX, NW, HA, NU, SF, BAF, MS, HB, JK, GEPM, ARM, FG, VH, PF, MW, ECS, PM, HT, TB participated in sample preparation. PR performed crystallization experiments. SeG, PR, JL, TJL, OY, SS, AT, MGr, HF, FT, MGa, YG, CFL, SA, AP, GB, DVS, GP, TRS, IB, SP performed X-ray data collection. TJL, HGin, DO, OY, LG, MD, TAW, FS, CR, DM, JZD, IK, CS, RS, HUH, DCFM contributed to X-ray data management. SeG, PR, JL, TJL, HGin, FHMK, WE, DO, AH, VS, JH, JM, JB, JW, CF, MSW, AC, DT, WH, AM performed X-ray data analysis. KL, BK, CU, RC performed and analyzed MS experiments. YFG, BEP, StG performed and analyzed antiviral activity assays. PG, BE, MK, MGA, SN, CG, LZ, XS, KK, AU, JL, RH performed and analyzed ligand binding studies and protein activity assays. CE, JPZ, MR performed computational binding studies. Competing interests: MR is stakeholder of BioSolveIT GmbH, licensor of the software HYDE. Data and materials availability: The coordinates and structure factors for all described crystal structures of SARS-CoV-2 Mpro in complex with compounds are deposited in the PDB with accession codes 6YNQ, 6YVF, 7A1U, 7ABU, 7ADW, 7AF0, 7AGA, 7AHA, 7AK4, 7AKU, 7AMJ, 7ANS, 7AOL, 7AP6, 7APH, 7AQE, 7AQI, 7AQJ, 7AR5, 7AR6, 7ARF, 7AVD, 7AWR, 7AWS, 7AWU, 7AWW, 7AX6, 7AXM, 7AXO, 7AY7, 7B83 and 7NEV. Code used in this analysis has been previously published (10). The code for forcing adherence to the Wilson distribution is included in the Vagabond refinement package ( under a GPLv3 license. Compounds from the Fraunhofer IME Repurposing collection were obtained from the Fraunhofer Institute for Molecular Biology and Applied Ecology under a Material Transfer Agreement. Compounds from the Safe-in-man Library were kindly provided by Dompé Farmaceutici S.p.A. Other materials are available from SeG or AM upon request. This work is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. To view a copy of this license, visit This license does not apply to figures/photos/artwork or other content included in the article that is credited to a third party; obtain authorization from the rights holder before using such material.

Stay Connected to Science

Navigate This Article