Enzymatic construction of highly strained carbocycles

See allHide authors and affiliations

Science  06 Apr 2018:
Vol. 360, Issue 6384, pp. 71-75
DOI: 10.1126/science.aar4239

Double rings made with heme

Cyclic organic structures with adjacent three-carbon rings—bicyclobutanes—are useful starting materials for chemical and materials synthesis owing to their extreme ring strain. Constructing these molecules is a challenging task for organic chemists, especially if a single stereoisomer is desired. Chen et al. engineered a heme-containing enzyme to catalyze sequential carbene insertion reactions using an alkyne substrate. Starting with an enzyme that could only catalyze a single carbene insertion, a series of mutations led to variants that catalyzed efficient, stereoselective production of bicyclobutanes. By using a less reactive alkyne substrate and screening more variants with active site mutations, the authors found enzymes that stop at either enantiomer of the intermediate cyclopropene.

Science, this issue p. 71


Small carbocycles are structurally rigid and possess high intrinsic energy due to their ring strain. These features lead to broad applications but also create challenges for their construction. We report the engineering of hemeproteins that catalyze the formation of chiral bicyclobutanes, one of the most strained four-membered systems, via successive carbene addition to unsaturated carbon-carbon bonds. Enzymes that produce cyclopropenes, putative intermediates to the bicyclobutanes, were also identified. These genetically encoded proteins are readily optimized by directed evolution, function in Escherichia coli, and act on structurally diverse substrates with high efficiency and selectivity, providing an effective route to many chiral strained structures. This biotransformation is easily performed at preparative scale, and the resulting strained carbocycles can be derivatized, opening myriad potential applications.

In cyclic organic molecules, ring strain arises from distortions of bond angle and bond length, steric clashes of nonbonded substituents, and other effects (1). The simplest carbocycles, cyclopropanes and cyclobutanes, possess ring strains of 26 to 28 kcal/mol (2). Introducing carbon-carbon multiple bonds or bridges to these small ring systems induces additional strain as well as structural rigidity. For example, cyclopropenes with an endo-cyclic double bond bear a strain of 54 kcal/mol, whereas bicyclo[1.1.0]butanes, folded into puckered structures, distinguish themselves as one of the most strained four-membered systems, with strain of ~66 kcal/mol (fig. S1) (2). These carbocycles are particularly attractive intermediates in chemical and materials synthesis, because they can undergo strain-release transformations to furnish a myriad of useful scaffolds (36). The structural rigidity imparted by strained rings in supramolecular materials can lead to interesting physical properties, such as mechanical stability (7) and high glass transition temperature (8). The intrinsic energy of these strained structures can also be relieved in response to exogenous force, which leads to radical changes in physical properties (e.g., conductivity), a feature highly desirable for stimulus-responsive materials (9, 10).

High ring strain, however, greatly increases the difficulty of synthesis. A commonly used method for preparing bicyclobutanes starts from dibromo-2-(bromomethyl)cyclopropane substructures and uses organolithium reagents for lithium-halogen exchange, followed by nucleophilic substitution under rigorously anhydrous and cryogenic conditions (3). An alternative route relies on the double transfer of a carbene to alkynes, but the few examples in the literature are mostly limited to methylene carbene (1113). Asymmetric bicyclobutane construction is particularly challenging, with multiple chiral centers generated at the same time (14, 15) (fig. S2). Cyclopropene synthesis through enantioselective single-carbene addition to alkynes also requires chiral transition metal catalysts based on rhodium (16, 17), iridium (18), and cobalt (19). Development of a sustainable catalytic system that performs with high efficiency and selectivity under ambient conditions would be a major advance for construction of these useful, highly strained carbocycles.

Enzymes, the catalytic workhorses of biology, are capable of accelerating chemical transformations by orders of magnitude while exhibiting exquisite control over selectivity (20). Although nature synthesizes various cyclopropane-containing products (21), cyclopropene or bicyclobutane fragments are extremely rare (fig. S3) (22, 23). This may be attributed to the lack of biological machinery for synthesizing these motifs and/or the instability of these structures under biological or natural product isolation/purification conditions. Nonetheless, we envisioned that existing enzymes could be repurposed to forge strained carbocycles by taking advantage of their catalytic promiscuity (24, 25) in the presence of non-natural substrates and by using directed evolution to optimize the activity and selectivity of these starting enzymes (26).

In the past several years, we and others have engineered natural hemeproteins to catalyze reactions not known in nature (2732). We hypothesized that carbene transfer to triple bonds with a heme-dependent enzyme might afford highly strained cyclopropene and bicyclobutane structures and might do so enantioselectively. We anticipated several challenges at the outset, especially in chiral bicyclobutane formation, as it involves two sequential carbene additions to the alkyne substrate: (i) The enzyme would need to bind the alkyne in a specific conformation in order to transfer the carbene enantioselectively; (ii) the high-energy cyclopropene intermediate generated by the first carbene addition would need to be accepted and stabilized by the protein; (iii) relative to methylene carbene used previously, a substituted carbene (e.g., with an ester group) might hinder access of the cyclopropene to the iron-carbenoid; and (iv) the protein would also need to exert precise stereocontrol over the second carbene transfer step, regardless of structural differences between the initial alkyne and the cyclopropene intermediate. Despite these challenges, we decided to investigate whether a starting enzyme with this unusual and non-natural activity could be identified, and whether its active site could be engineered to create a suitable environment for substrate binding, intermediate stabilization, and selective product formation.

We first tested whether free heme [with or without bovine serum albumin (BSA)], which is known to catalyze styrene cyclopropanation (27), could transfer carbene to an alkyne. Reactions using ethyl diazoacetate (EDA) and phenylacetylene (1a) as substrates in neutral buffer (M9-N minimal medium, pH 7.4) at room temperature, however, gave no cyclopropene or bicyclobutane product. Next, a panel of hemeproteins—including variants of cytochrome P450, cytochrome P411 (P450 with the axial cysteine ligand replaced by serine), cytochrome c, and globins in the form of E. coli whole-cell catalysts—were tested for the desired transformation under anaerobic conditions (32), but none were fruitful (Fig. 1C and table S1). Interestingly, a P411 variant obtained in a previous cyclopropanation study, P411-S1 I263W (see supplementary materials for sources, sequences, and mutations), afforded a furan product (3b) with a total turnover number (TTN) of 210. Because other furan analogs have been identified as adducts of carbenes and alkynes (33), we were curious as to how furan 3b was generated. Preliminary kinetic study of the enzymatic reaction suggested that the enzyme first synthesized an unstable cyclopropene (3a), which subsequently rearranged to the furan either spontaneously or with assistance from the enzyme (Fig. 1B and fig. S5). This result provided strong evidence that the P411 hemeprotein is capable of transferring a carbene to an alkyne, which is, to our knowledge, an activity not previously reported for any protein or even any iron complex.

Fig. 1 Hemeprotein-catalyzed bicyclobutane formation.

(A) Overall reaction of carbene transfer to an alkyne catalyzed by an engineered hemeprotein (Et, ethyl; Ph, phenyl). (B) Proposed catalytic cycle of carbene transfer to phenylacetylene to form cyclopropene and bicyclobutane structures. (C) Screening of hemin and hemeprotein catalysts for bicyclobutane formation (BSA, bovine serum albumin; WT, wild type; TDE, V75T M100D M103E; H*, C400H). See supplementary materials for sources, sequences, and mutations in Bacillus megaterium P411-S1 and other proteins. (D) X-ray crystal structure of P411-E10 (PDB ID: 5UCW) (29) and view of its distal heme region. The P411 heme axial ligand is S400; amino acid residues Val78, Leu263, and Ser438 are shown as sticks. (E) Directed evolution of P411-E10 for bicyclobutane formation [using phenylacetylene and EDA as substrates; numbers refer to total turnovers to product (TTN) measured]. Experiments were performed at analytical scale using suspensions of E. coli expressing P411-E10 variants [optical density at 600 nm (OD600) = 10 to 30], 10 mM phenylacetylene, 10 mM EDA, 5 vol % EtOH, and M9-N buffer (pH 7.4) at room temperature under anaerobic conditions for 6 hours. Reactions were performed in quadruplicate. TTN refers to the total desired product, as quantified by gas chromatography (GC), divided by total hemeprotein. (Note: Because bicyclobutane formation requires two carbene transfers, the number of carbene transfers the hemeprotein catalyzes is 2 × TTN in these reactions.) See supplementary materials for further details of reaction conditions and data analysis. Single-letter amino acid abbreviations (here or in Fig. 3): A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; I, Ile; L, Leu; M, Met; P, Pro; S, Ser; T, Thr; V, Val; W, Trp; Y, Tyr.

To divert the enzymatic reaction to bicyclobutane formation, the enzyme would have to transfer a second carbene to cyclopropene intermediate 3a before the cyclopropene rearranges to the undesired furan product (Fig. 1B). We thus tested P411 variants closely related to P411-S1 I263W. We reasoned that amino acid residue 263, which resides in the distal pocket above the heme cofactor, might modulate the rate of this step, and that the bulky tryptophan (Trp) side chain at this site may be blocking the second carbene transfer. A P411-S1 variant with phenylalanine (Phe) instead of Trp at this position (263F) in fact catalyzed bicyclobutane formation at a very low level (<5 TTN) (table S1). Variant P4 with three additional mutations relative to P411-S1 I263F (V87A, A268G, and A328V) (28) synthesized the desired bicyclobutane 2a with 80 TTN and with the formation of furan adduct substantially suppressed (2a:3b > 50:1; Fig. 1C). Another related P411 variant, E10 (= P4 A78V A82L F263L), which was engineered from P4 for nitrene transfer reactions (29), catalyzed the desired transformation with a factor of >6 higher activity (530 TTN, Fig. 1E). Nuclear magnetic resonance (NMR) analysis revealed an exo, endo-configuration of the enzymatically produced bicyclobutane 2a, which is distinct from the only reported achiral endo, endo-isomer, made using an osmium-porphyrin complex (34, 35). We chose this P411-E10 variant as the starting template for directed evolution of an even more efficient bicyclobutane-constructing enzyme.

Because the side chain of residue 263 influenced formation of the bicyclobutane product, we performed site-saturation mutagenesis (SSM) of variant E10 at position 263 and screened whole E. coli cells expressing the mutated proteins for improved production of bicyclobutane 2a. The enzyme having leucine at this position (263L) was the most active; other amino acid residues either lowered the reactivity toward bicyclobutane formation and/or delivered more furan product. In parallel, two additional residues in E10, Val78 and Ser438, were also targeted by SSM. Aromatic residues were found to be activating at position 78, with a phenylalanine or tyrosine mutation giving a factor of 1.5 to 2 improvement over E10. This beneficial effect may stem from a π-π stacking interaction between the side chain and the alkyne substrate or the cyclopropene intermediate. A single S438A mutation on a loop residing above the heme also increased the activity, giving a factor of >2.5 increase in turnover. Finally, recombination of V78F/Y and S438A mutations led to the discovery of even more powerful biocatalysts for bicyclobutane formation (e.g., 1880 TTN with E10 V78F S438A) (Fig. 1E and fig. S9).

With the evolved E10 V78F S438A variant in hand, we next assayed the bacterial catalyst against a panel of aromatic alkyne coupling partners. Biotransformations with 10 different substrates were performed on a scale of 0.1 to 0.2 mmol. These preparative-scale reactions proceeded smoothly to furnish the corresponding bicyclobutanes with up to 1760 TTN and 80% yield (Fig. 2A). Additionally, three alkynes, 1k, 1l, and 1m, were transformed at mmol scale, and bicyclobutanes were isolated in hundred-milligram quantities, demonstrating that the biocatalytic transformation is readily scalable. Among the 13 different substrates, the engineered P411 hemeprotein did not exhibit strong preference toward specific electronic or steric features. Electron-deficient halides (2b2d), which can be used as prefunctionalities for further transformations, were accepted by the enzyme, as were electron-rich alkyl or alkoxyl groups (2e2h and 2k) at the meta or para position of the phenyl group. Even heterocyclic substrates such as thiophene (2j) served as suitable alkyne partners, albeit with lower reactivity.

Fig. 2 Scope of bicyclobutane formation and derivatization.

(A) Scope of P411-E10 V78F S438A–catalyzed bicyclobutane formation. Standard conditions of preparative-scale reactions (0.1- to 0.2-mmol scale unless otherwise indicated): suspension of E. coli (OD600 = 15 to 20) expressing P411-E10 V78F S438A, 1.0 equiv aromatic alkyne, 2.0 to 4.0 equiv EDA, 10 to 15 mM d-glucose, 1 to 5 vol % EtOH, and M9-N buffer (pH 7.4) at room temperature under anaerobic conditions for 12 hours. TTNs were determined on the basis of the isolated yields shown. (B) Derivatization of bicyclobutane products: (a) and (b), copper-catalyzed click cyclization of 2l with azide substrates (Ac, acetyl; CuTc, copper(I) thiophene-2-carboxylate; EDCI, 1-ethyl-3-(3-dimethylaminopropyl)carbodiimide hydrochloride); (c), esterification of 2m with Mosher’s acid; (d), reduction of 2k to diol with LiBH4. See supplementary materials for further details of reaction conditions and data analysis.

Free functionalities, including alcohols (2i and 2m) and a second alkyne (2l), are well preserved, providing an additional opportunity for derivatization of these products. A terminal alkyne allows copper-catalyzed click chemistry, through which bicyclobutane 2l can be modified with a simple sulfonyl azide (4a) or even decorated with biologically relevant fragments, such as a phenylalanine derivative (4b). An unprotected hydroxyl group could also offer the possibility of linkage to useful structures. Additionally, to probe the enantiopurity of bicyclobutane products, we derivatized 2l and 2m with L-azido-phenylalanine and (R)-Mosher’s acid, respectively. The diastereomeric excess of these derivatized products would inform us of the enantiomeric ratio (e.r.) of the bicyclobutanes. In fact, we observed only one diastereomer of derivatized bicyclobutanes 4b and 4c by NMR. Furthermore, the dicarboxylic esters on the bicyclobutane structure can be reduced easily with a mild reducing reagent, LiBH4, to give diol product 4d with the strained ring structure preserved. The diol product 4d allowed for the unequivocal confirmation of the bicyclobutane structure and determination of the absolute configuration through x-ray crystallography.

We next asked whether the enzyme could stop at the cyclopropene product if less reactive aliphatic alkynes were used. To this end, we examined enzyme variants from the P411-S1 lineage for cyclopropene formation, using phenylbutyne (5a) and EDA as starting reagents. We were encouraged to see that P4 catalyzed the desired transformation with 260 TTN and 95.5:4.5 e.r. Further evolution was performed on P4 to improve its catalytic efficiency. We first targeted position 87, known for its importance to substrate recognition in P450-catalyzed oxidations (36). A87F (290 TTN, 3.0:97.0 e.r.) and A87W (240 TTN, 97.1:2.9 e.r.) were found to exert the opposite enantiopreference, suggesting that residue 87 also controls substrate orientation for non-native carbene chemistry. Single- and double-site-saturation mutagenesis conducted sequentially on P4 A87F and P4 A87W improved both reactivity and selectivity (Fig. 3A and figs. S11 and S13). The final K10 and C6 variants performed with higher activity by a factor of >10 relative to the initial P4 variant and with excellent stereocontrol (99.55:0.45 and 99.95:0.05 e.r., respectively).

Fig. 3 Engineering P411 enzymes for stereodivergent cyclopropene formation: Scope and derivatization.

(A) Evolutionary trajectory of P411-P4 variants for stereodivergent cyclopropenation of aliphatic alkynes. (B) Scope of P411-C6–catalyzed cyclopropene formation. Standard conditions of preparative-scale reactions (0.08- to 0.4-mmol scale): suspension of E. coli (OD600 = 10 to 32) expressing P411-C6 or K10, 1.0 equiv alkyne, 1.0 to 4.0 equiv EDA (6.0 equiv for 5m), 10 to 15 mM d-glucose, 1 to 5 vol % EtOH, and M9-N buffer (pH 7.4) at room temperature under anaerobic conditions for 12 hours (iPr, isopropyl). TTNs were determined on the basis of the isolated yields shown; e.r. values were determined by chiral high-performance liquid chromatography (HPLC). (C) Enzymatic cyclopropenation at mmol scale and derivatization of corresponding products: (a), copper-catalyzed addition to cyclopropene 6a for synthesizing a multisubstituted cyclopropane; (b), Diels-Alder reaction of cyclopropene 6h with 2,3-diMe-buta-1,3-diene to form a fused-ring system. See supplementary materials for further details of reaction conditions and data analysis.

To evaluate the substrate range of the evolved P411 variants for cyclopropene construction, we focused on P411-C6 and examined structurally diverse aliphatic alkynes. Enzymatic reactions with 12 alkynes at preparative scale (up to 5.0 mmol) afforded the desired cyclopropenes, with TTNs ranging from hundreds to thousands and good to excellent stereoselectivities (Fig. 3, B and C). Alkynes with a linear carbon chain (5b) or cyclic fragments (5g, 5h, and 5j) all served as good substrates. Different functional groups, including ether (5f, 5i, and 5l), ester (5d), acetal (5e), chloride (5k), and free hydroxyl (5m), were well tolerated. Further optimization of reaction conditions with slow addition of EDA, for example, would likely improve the isolated yields, as we demonstrated for cyclopropene 6h (66% yield, Fig. 3B; 94% yield, Fig. 3C).

Cyclopropenes are used as synthetic building blocks (4, 37), bio-orthogonal imaging precursors (38), and monomers in polymer synthesis (39). Our ability to construct these motifs using bacteria at scale allows us to further explore their potential utility in diverse fields. We carried out two simple transformations of cyclopropenes to build a multisubstituted cyclopropane 7a and a fused ring system, [4.1.0]heptene 7b (Fig. 3C), both of which are substructures common in pharmaceutical candidates and bioactive natural products (21).

Our results constitute a biocatalytic platform for the construction of highly strained bicyclobutanes and cyclopropenes through directed evolution of a serine-ligated cytochrome P450 (P411) enzyme. That the protein could be quickly adapted to produce these highly strained structures (three to six rounds of mutagenesis and screening) highlights the evolvability of the P411 scaffold and its potential to direct the construction of complex motifs. The protein enabled the desired transformations through activation of iron-carbenoid for carbene addition to alkynes, stabilization of the reactive cyclopropene intermediate (in bicyclobutane formation), and precise stereocontrol of the carbene transfer processes. Biotransformations with the evolved enzymes have a surprisingly broad substrate scope with high reactivity and selectivity, providing a route to more than 25 products in preparative scale. This biocatalytic system grants facile access to versatile molecular architectures rarely seen in nature, expanding the set of chemical structures available to biological systems.

Supplementary Materials

Materials and Methods

Figs. S1 to S15

Tables S1 to S9

References (4071)

References and Notes

Acknowledgments: We thank D. K. Romney, S. C. Hammer, and S.-Q. Zhang for helpful discussions and comments on the manuscript; C. K. Pier, O. F. Brandenberg, and A. M. Knight for sharing hemeprotein variants; K. Ding (D. J. Anderson Lab, Caltech) and J. Li (R. H. Grubbs Lab, Caltech) for generous donation of materials and reagents; S. C. Virgil and the Caltech Center for Catalysis and Chemical Synthesis, N. Torian and the Caltech Mass Spectrometry Laboratory, and M. K. Takase, L. M. Henling, and the Caltech X-ray Crystallography Facility for analytical support; and B. M. Stoltz for use of polarimeter and chiral gas chromatography equipment. Funding: Supported by NSF Division of Molecular and Cellular Biosciences grant MCB-1513007; Ruth L. Kirschstein NIH Postdoctoral Fellowship F32GM125231 (X.H.); and NSF Graduate Research Fellowship grant DGE-1144469 and the Donna and Benjamin M. Rosen Bioengineering Center (R.K.Z.). R.K.Z. is a trainee in the Caltech Biotechnology Leadership Program. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the funding organizations. Author contributions: conceptualization, K.C.; methodology, K.C.; validation, K.C. and X.H.; formal analysis, K.C., X.H., S.B.J.K., and R.K.Z.; writing (original draft), K.C. and F.H.A.; writing (review and editing), X.H., S.B.J.K., and R.K.Z.; funding acquisition, F.H.A.; supervision, F.H.A. Competing interests: K.C., X.H., and S.B.J.K. are inventors on patent application (CIT-7744-P) submitted by California Institute of Technology that covers biocatalytic synthesis of strained carbocycles. Data and materials availability: All data are available in the main text or the supplementary materials. Plasmids encoding the enzymes reported in this study are available for research purposes from F.H.A. under a material transfer agreement with the California Institute of Technology. Crystallographic coordinates and structure factors have been deposited with the Cambridge Crystallographic Data Centre ( under reference number 1815089 for compound 4d.
View Abstract

Stay Connected to Science

Navigate This Article