Research Article

Crystal Structures of Human Topoisomerase I in Covalent and Noncovalent Complexes with DNA

See allHide authors and affiliations

Science  06 Mar 1998:
Vol. 279, Issue 5356, pp. 1504-1513
DOI: 10.1126/science.279.5356.1504


Topoisomerases I promote the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replication, transcription, and recombination. The crystal structures at 2.1 and 2.5 angstrom resolution of reconstituted human topoisomerase I comprising the core and carboxyl-terminal domains in covalent and noncovalent complexes with 22–base pair DNA duplexes reveal an enzyme that “clamps” around essentially B-form DNA. The core domain and the first eight residues of the carboxyl-terminal domain of the enzyme, including the active-site nucleophile tyrosine-723, share significant structural similarity with the bacteriophage family of DNA integrases. A binding mode for the anticancer drug camptothecin is proposed on the basis of chemical and biochemical information combined with these three-dimensional structures of topoisomerase I–DNA complexes.

Topoisomerases are ubiquitous enzymes that solve topological problems generated by key nuclear processes such as DNA replication, transcription, recombination, repair, chromatin assembly, and chromosome segregation (1). There are two types of topoisomerases. Type I enzymes are monomeric and transiently break one strand of duplex DNA, allowing for single-step changes in the linking number of circular DNAs (the number of times one strand of DNA crosses the other). Type II enzymes are dimeric and break both strands of a duplex to generate a gate through which another region of DNA can be passed, resulting in linking number changes in steps of two. Type I and type II enzymes are fundamentally different in both mechanism and cellular function (2-4). The medical importance of these enzymes is underscored by the fact that they are the specific targets of many promising anticancer drugs (5).

Eukaryotic type I topoisomerases differ significantly from their prokaryotic counterparts. Prokaryotic enzymes require magnesium and a single-stranded segment of DNA; additionally, they form a covalent intermediate with the 5′ end of the broken DNA strand and relax only negatively supercoiled DNA. Eukaryotic topoisomerase I enzymes, in contrast, do not require any metal cofactor, need no single-stranded stretch of DNA, form a covalent intermediate with the 3′ end of the broken strand, and are able to relax both positive and negative supercoils. All available evidence indicates that prokaryotic and eukaryotic type I enzymes share no sequence or structural similarity. Vaccinia virus topoisomerase (6), however, shares several mechanistic and sequence features with eukaryotic type I topoisomerases but is much smaller (36 kD) than all known cellular topoisomerases I (7), which range from 80 to 110 kD (8). So far, no structure of a eukaryotic topoisomerase I containing all elements essential for catalysis has been reported (9).

Human topoisomerase I (topo I) is a monomeric protein of 765 amino acids (10). Limited proteolysis studies of human topo I (11), and the crystallographic structure determination of an inactive fragment of the yeast enzyme (2), indicate that eukaryotic topo I consists of four major regions: the NH2-terminal, core, linker, and COOH-terminal domains. The ∼210-residue NH2-terminal region of human topo I is highly charged, contains very few hydrophobic amino acids, and is largely disordered (12). This domain contains several nuclear-targeting signals (13) and has been implicated in nucleolar localization through interactions with nucleolin (14). The COOH-terminal domain, comprising residues ∼713 to 765, contains the essential catalytic Tyr723, which forms a phosphoester bond with the 3′ phosphate at the site of cleavage of the scissile strand of the substrate DNA (15). Purified COOH-terminal domain can be combined with isolated core domain composed of residues ∼200 to ∼635 to reconstitute near full enzyme activity (16). Thus, residues ∼636 to 712, which form the so-called linker domain, contribute to but are not required for activity (16). Hence, human topo I is a multidomain enzyme that contains two highly conserved globular domains (the core and the COOH-terminal domain) that are crucial for catalytic activity, and two other regions (NH2-terminal and linker) that are not strictly required for its catalytic and relaxation functions.

The “reconstituted topo I” used in these studies was obtained by mixing the purified 58-kD core domain (consisting of residues 175 to 659, expressed in the baculovirus-insect cell system) and the 6.3 kD COOH-terminal domain (residues 713 to 765, expressed inEscherichia coli) to reconstitute a tightly associated 1:1 complex (16). The reconstituted enzyme exhibits the same specific activity as either the NH2terminally truncated enzyme (residues 175 to 765) or the full-length protein when assayed under physiological salt conditions (16, 17).

Although human topo I does not exhibit a high degree of sequence specificity for its target cleavage sites (18), a sequence from the ribosomal DNA of Tetrahymena thermophilus has been described by Westergaard and his co-workers that is a high-affinity binding and cleavage site for all eukaryotic type I enzymes (19). Here we describe high-resolution structures of two variants of the reconstituted enzyme, each in association with a slightly different 22–base pair (bp) version of theTetrahymena DNA sequence. The first is the structure of the Tyr723→Phe inactive mutant in a noncovalent complex at 2.5 Å resolution, and the second is the covalent complex at 2.1 Å resolution. Together with the structure of the 70-kD NH2terminally truncated form of human topo I in complex with DNA described in the accompanying report (20), these complexes provide detailed insights into the mode of DNA binding and DNA manipulation by this key human enzyme.

The structure of human topo I is of considerable biomedical importance because this enzyme is the sole target of the camptothecin (CPT) family of anticancer drugs (5, 21-23). Two analogs of CPT, topotecan and irinotecan, have been used successfully in the treatment of several human cancers (5, 24,25). CPT and its derivatives act by prolonging the lifetime of the nicked intermediate in the topo I reaction (22,26-28). The resulting stabilized covalent intermediates are presumed to form obstacles to the advancement of transcription and replication complexes that eventually lead to DNA damage and cell death (29, 30). On the basis of the spatial distribution of residues that when mutated render the enzyme resistant to CPT (31, 32) and on the allowed chemical modifications of the compound (5), we propose a model for the binding of the drug to enzyme-DNA complexes.

Structure determination. The structures of human topo I reported here and in the accompanying report (20) are the result of a considerable effort in which six different crystal forms were produced and over 1000 crystals examined in the x-ray beam. The reconstituted (33) structures in both covalent and noncovalent complex with a 22-bp DNA duplex were determined by multiple isomorphous replacement (MIR) with mercury, platinum, and 5-iodo-deoxyuridine derivatives (Table 1). In the structure of the noncovalent complex with DNA (crystal form 2), the protein was inactivated by replacement of Tyr723 with Phe. The covalent complex (crystal form 6) was generated by introducing a 5′-bridging phosphorothioate group at the DNA cleavage site, which effectively traps the enzyme covalently attached to the DNA via a phosphoester bond to the active-site tyrosine (34). Nonisomorphism, large mosaic spread, and poor crystal quality made the structure determination extremely difficult, and only the use of precise crystal growth and cryo-cooling protocols, synchrotron radiation, and phasing with the program SHARP (35) allowed the structures to be elucidated (Table 1). One crystal of the covalent complex (form 6) diffracted to 2.1 Å resolution, and the structure of this crystal form was solved by MIR. Repositioning of the form 6 model into the nearly isomorphous form 2 cell allowed the structure of the noncovalent complex to be determined to 2.5 Å resolution. In both structures, the protein was well defined for almost the entire length of the polypeptide chain, and the DNA oligonucleotide was clearly defined over its entire length.

Table 1

Crystallographic structure determination of reconstituted human topo I–DNA complexes. Native data for heavy-atom phasing were collected from a crystal containing eight 5-iodo-deoxyuridine nucleotides on the DNA oligo at positions +8, +9, +10, +11 on the scissile strand and +3, +4, +5, +6 on the intact strand (see Fig. 4, F and G, for oligonucleotide numbering scheme). Heavy-atom derivatives were as follows: Hg-1: 1 mM ethylmercury phosphate, 14 hours; DNA oligo with six 5-iodo-deoxyuridine substitutions, at positions −2, −1, +8, +9, +10, +11 on the scissile strand; Hg-2: 1 mM thimerosal, 24 hours; Iodo: DNA oligo with 15 5-iodo-deoxyuridine substitutions, at positions −2, −1, +8, +9, +10, +11 on the scissile strand and −9, −8, −7, −6, −4, +3, +4, +5, +6 on the intact strand; Pt: 0.1 mM PIP [di-μ-iodobis (ethylenediamine) diplatinum (II) nitrate], 28 hours; DNA oligo with 10 5-iodo-deoxyuridine substitutions, at positions –2, −1, +8, +9, +10, +11 on the scissile strand and +3, +4, +5, +6 on the intact strand. The mean figures of merit [〈|ΣP(α)ei aP(α)|〉, where α is the phase and P(α) is the phase probability distribution] before density modification were 0.57 and 0.60 for acentric and centric data, respectively. Data used for form 6 refinement were collected from a crystal containing 10 5-iodo-deoxyuridine nucleotides on the DNA oligo (Fig. 4G), and data used for form 2 refinement were collected from a crystal containing eight 5-BrdU substitutions (Fig. 4F). Crystallization: Crystals of the noncovalent and covalent reconstituted complexes with DNA were grown by vapor diffusion at 22°C from sitting drops that were prepared by mixing 3 μl of crystallant [100 mM tris-HCl (pH 7.7), 24% v/v PEG 400, 100 mM MgCl2, and 10 mM DTT], 3 μl of water, 1 μl of duplex oligo (0.1 mM in 6 mM NaCl), and 2 μl of protein [5 mg/ml in 10 mM tris-HCl (pH 7.5), 1 mM EDTA, and 5 mM DTT]. Crystals of ∼0.1 mm by 0.2 mm by 0.4 mm grew in 2 to 3 weeks at room temperature. Data collection and structure determination: Crystals were flash-frozen in either liquid nitrogen or a nitrogen gas stream at 100 K with a stabilization buffer {the crystallant without DTT [100 mM tris-HCl (pH 7.7), 24% v/v PEG 400, 100 mM MgCl2]} as the cryo-protectant. Data were collected at ∼100 K at several synchrotron sources, processed with DENZO, and reduced with SCALEPACK (47). Native and derivative data from form 6 (covalent complex) crystals were collected at Stanford Synchrotron Radiation Laboratory (SSRL), beamline 9-1, by a MAR image plate system. The x-ray wavelength was adjusted to 0.980 Å to obtain an optimal anomalous scattering signal for Hg derivatives. Data used for form 2 (noncovalent complex) refinement was the peak wavelength of a multiwavelength anomalous dispersion experiment collected at the beamline X4A (National Synchrotron Light Source, Brookhaven National Laboratory) with Fuji image plates. Both form 6 and form 2 crystals belonged to space group P21 with nearly identical cell constants (a = 72.0 Å, b = 66.6 Å, c = 71.8 Å, β = 98.3° for form 6;a = 71.8 Å, b = 66.3 Å, c = 71.8 Å, β = 98.4° for form 2) and one protein-DNA complex in the asymmetric unit. Despite isomorphous cell constants, crystal-to-crystal nonisomorphism within both form 2 and form 6 crystals was revealed by high R iso values (15 to 20%) between native data sets. Eventually, out of nearly 80 data sets collected, a form 6 data set from a crystal containing eight 5-iodo-deoxyuridine substitutions on the DNA oligo was chosen as the native, and four form 6 data sets were chosen as isomorphous derivatives (two Hg, one Pt, and one with only I substitutions). Heavy-atom derivatives were prepared by washing crystals three times in the stabilization buffer to remove DTT, followed by soaks in heavy-atom compounds dissolved in this buffer. Heavy-atom sites were identified by inspection of difference Pattersons. Initial heavy-atom refinement and phasing with anomalous signal were carried out at 3.2 Å resolution with MLPHARE (48), with phase improvement by DM (49). Resultant electron density maps revealed some structural features but were not clearly traceable. Heavy-atom refinement and phasing with SHARP (35) and density modification with SOLOMON (50) yielded a readily traceable electron density map at 3.2 Å resolution (Fig. 3). The improvement gained with SHARP may have been due to its refinement of nonisomorphism parameters. All 22 bp of the DNA, nearly the complete main chain of the protein, and more than half the protein side chains were built with this MIRAS map and O (51). The covalent protein-DNA attachment was also evident in this initial MIRAS map. The map was consistent with the position of the 26-kD yeast topo I fragment (2) in the form 6 cell found with AMORE (52). Structural refinement: The form 6 model was refined with X-PLOR (53) and simulated annealing (54). Seven percent of the data were set aside for free-Rcross-validation (55) before any structural refinement. The remainder of the protein was built and improved with SigmaA-weighted (56) 2|F obs| − |Fcal c| and |F ob s| − |F calc| maps iteratively with X-PLOR refinement. The high-resolution (2.1 Å) data set used for the refinement of the form 6 model was collected at SSRL beamline 9-1 and was obtained from a crystal containing 10 5-iodo-deoxyuridine substitutions (Fig. 4G). Occupancies of these iodine positions were refined alternately with individual B factors until convergence. The final form 6 model contains residues 215 to 627 and 720 to 765 of human topo I, 22 DNA bp, and 360 waters, with good geometry and no Ramachandran outliers (57). The form 2 structure was solved by rigid body adjustment of the form 6 model, followed by coordinate refinement with X-PLOR and model rebuilding with O. The data set used for the refinement of the form 2 model was obtained from a crystal containing eight 5-BrdU substitutions (Fig. 4F), and the occupancies of these bromine positions were also refined alternately with individual B factors until convergence. A bulk solvent correction (X-PLOR) was used in the final stages of the refinement of both the form 2 and form 6 models. The final form 2 model contains residues 215 to 635 and 715 to 765 of human topo I, a 22-bp DNA duplex, and 63 waters, with good geometry and no Ramachandran outliers (57). Figures were created with MOLSCRIPT (58), Raster3D (59), GRASP (60), and ALSCRIPT (61).

Phasing by MIR with anomalous scattering of the covalent complex (crystal form 6)

View this table:

The human topo I protein. Reconstituted human topo I is composed of four domains that temporarily clamp around the DNA. In its closed form it contains a central DNA-binding pore of 15 to 20 Å in diameter (Fig. 1, A and B). Subdomains I, II, and III constitute the 56-kD core domain of the molecule, which in turn associates closely with the COOH-terminal domain to form the complete reconstituted enzyme. Subdomain I consists of residues 215 to 232 and 320 to 433 and is made up of two α helices and nine β strands (Fig. 2A). Subdomain II runs from residue 233 to 319 and is composed of five α helices and two β strands. Subdomains I and II fold tightly together, burying ∼1000 Å2 of solvent-accessible surface area on both subdomains, and form the top half or “cap” of the enzyme (Fig. 1, A and B). These domains are further characterized by two long “nose-cone” helices (α5 and α6) that come together in a “V” at a point 25 Å away from the body of the molecule. Subdomains I and II are similar in structure to the 26-kD yeast topo I fragment (2), with a root-mean-square (rms) deviation of 1.7 Å for 212 Cα positions and 53% sequence identity.

Figure 1

The structure of reconstituted human topoisomerase I. (A) Viewed perpendicular to the pore. Core subdomains I, II, and III are shown in yellow, blue, and red, respectively, and the COOH-terminal domain is shown in green. The enzyme consists of a “cap” consisting of core subdomains I and II (with the extended nose-cone helices facing to the right), and the closely associated complex of core subdomain III and the COOH-terminal domain at the bottom. (B) A view of the enzyme down the pore, rotated 90° about the vertical axis relative to (A), and with the domains colored as indicated above. The central pore of the molecule varies from 15 to 20 Å in diameter and provides an extensive, highly positively charged binding region for the substrate DNA. The overall dimensions of the reconstituted enzyme are 70 Å by 60 Å by 60 Å. The cap of the molecule and core subdomain III contact one another at two lips (residues 367 to 369 of subdomain I and residues 497 to 499 of subdomain III), as shown on the right central part of this figure.

Figure 2

Structural similarities between human topo I and HP1 integrase. (A) The sequence and secondary structural elements of reconstituted human topo I are indicated in the standard coloring scheme of the domain architecture of the enzyme (Fig. 1A), and the structurally similar regions of HP1 integrase are shown in red with gray background. Catalytically relevant residues of human topo I are highlighted in cyan, and the positions of known CPT-resistant mutations in human, hamster, and yeast topoisomerases I are shown in gray. α-Helices 18 and 19 are not depicted because these correspond to the linker domain (20), which is not present in the reconstituted enzyme. (B) Stereoview of the superposition of core subdomain III (red) and the COOH-terminal domain (green) of human topo I and bacteriophage HP1 integrase (gray) (38). The active-site residues of each enzyme are shown, with the human topo I residues in cyan and the integrase residues in gray. Helices 8, 10, 15, and 17 of core subdomain III of topo I are also indicated. There is no structural equivalent in the integrase for the topo I COOH-terminal domain past the first eight residues, which contain the catalytic Tyr723. The Cα positions of the active-site residues Arg488 and Arg590 of topo I (20) superimpose within 0.6 and 1.9 Å, respectively, of the Cα positions of Arg207 and Arg283 in the integrase. His306 of the integrase superimposes within 3.3 Å on His632 of human topo I, but the putative catalytic His280 of the integrase superimposes on a noncatalytic residue of human topo I, Lys587. Abbreviations for the amino acid residues are as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; I, Ile; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr.

Subdomain III runs from residue 434 to 635 and is a complex arrangement of 10 α helices and 3 β strands. This subdomain contains all the active-site residues, with the exception of the catalytic Tyr723 (Fig. 2A). Subdomain III extends from the top half of the molecule downward by way of two long α helices (8 and 9) that most likely function as the hinge that opens and closes the enzyme around the DNA (Fig. 1B). Subdomains I and III interact via two short “lips” opposite from these long-hinge helices. This is the only point of contact between the top and bottom halves of the molecule on that side of the clamp, and involves three residues from each subdomain and one salt bridge. The core of the enzyme interacts closely with the COOH-terminal domain primarily by way of α helices 8, 16, and 17 of subdomain III (Fig. 1A). This largely hydrophobic interface buries ∼1300 Å2 of solvent-accessible surface on each domain and is made up of 11 hydrophobic residues, as well as five specific hydrogen bonds and three salt bridges. The compact COOH-terminal domain runs from residues 713 to 765, is composed of five short α helices, and contains the active-site Tyr723 (Fig. 2A).

The three subdomains of the core and the COOH-terminal domain form a 64.3-kD particle with overall dimensions of 70 Å by 60 Å by 60 Å. The close integration between the four domains that is observed in the structure even in the absence of the 77-residue linker domain is in agreement with the observed stability of the reconstituted enzyme (16). The central pore of the molecule is composed largely of positively charged residues, 15 lysines and 8 arginines, which give rise to a high positive electrostatic potential surrounding the pore. The catalytic residues of the enzyme, including Tyr723, are contained within this highly positively charged channel region.

The structures of the reconstituted human topo I proteins in our two complexes are nearly identical, with an rms deviation of 0.45 Å on Cα positions. The only significant differences between the structures occur in the active-site region (as discussed below); in addition, residues 628 to 635 of subdomain III and 715 to 719 of the COOH-terminal domain are ordered in the noncovalent complex but are disordered in the covalent complex.

A search for structural similarity with other proteins with the program DALI (36) indicated that subdomain II of topo I was similar in structure to the homeodomain family of DNA-binding proteins. For example, residues 244 to 314 of human topo I superimpose on the POU homeodomain of the human Oct-1 transcription factor (37) with an rms deviation of 3.0 Å over 55 Cα positions, but show only 11% sequence identity. However, subdomain II of human topo I does not interact with the substrate DNA in either of the protein-DNA complexes we have solved, so any possible DNA binding function within this region of the enzyme still awaits further investigation.

A second structural similarity was identified by DALI between subdomain III of human topo I and the bacteriophage HP1 integrase (38). Residues 440 to 614 of topo I superimpose on residues 168 to 330 of integrase with an rms deviation of 3.8 Å for 129 Cα atoms with a sequence identity of 12% (Fig. 2, A and B). Although it was known that eukaryotic type I topoisomerases and several bacteriophage integrases share similar biochemical properties (39), a close structural similarity between the enzymes has not been described. Not only do 90% of the residues in subdomain III correspond with HP1 integrase, but also residues Gly717 to Leu724 of the COOH-terminal domain of topo I appear to correspond with residues 309 to 316 of integrase after subdomain III of topo I and integrase are superimposed (Fig. 2B). Most importantly, despite short insertions in each sequence, the active sites of the two enzymes line up remarkably well. The Cα position of the catalytic Tyr723 of human topo I falls within 3.9 Å of the corresponding atom in Tyr315 of the integrase after the two structures are superimposed. We observed that the absolutely conserved and catalytically important His632 of topo I corresponds to His306 of the integrase (20). The putative catalytic His280 of HP1 integrase superimposes on Lys587 of human topo I, which interacts with a phosphate away from the active site and is likely not a catalytic residue.

The DNA oligonucleotide structure. The 22-bp DNA oligonucleotides were fully visible in the electron density maps of both the covalent and the noncovalent complexes. In both crystal forms, the blunt-ended duplexes stack head-to-tail to form a pseudo-continuous helix along the crystallographic c axis (Fig.3). This stacking gives rise to the observed 72 Å length for the c axis in both crystal forms, which is shorter than the ∼75 Å expected for a 22-bp stretch of perfectly canonical B-DNA. Superpositions of the DNA duplexes with B-DNA reveal that the inner 10 bp (−4 to +6) deviate little from canonical B-form (with an rms deviation of 1.2 Å for 460 atoms), whereas the 6 bp at either end of the central region show more deviations from ideal B-DNA (rms deviation of 2.0 Å for 486 atoms). These regions of the oligonucleotide are more compacted and slightly underwound, probably giving rise to the shorter than canonical length along the crystallographic c axis. However, because these regions of the DNA oligonucleotide are not in contact with the protein (as described below), it seems likely that these observed deviations from ideal B-form DNA are crystal-packing effects rather than specific changes induced by the protein.

Figure 3

Experimental electron density map of symmetry-related DNA duplexes. Representative electron density from the experimental MIR with anomalous scattering map at 3.2 Å resolution is shown in purple (contoured at 1.2σ). The tail-to-head stacking arrangement of symmetry-related DNA oligonucleotides along the monoclinic c axis is evident (+8 to +12 region of DNA oligonucleotide downstream of the cleavage site in green; −10 to −6 region of symmetry-related DNA oligonucleotide upstream of the cleavage site in yellow). The entire DNA oligonucleotide, as well as the majority of the protein atoms, could be built into this experimental map.

Human topo I in complex with DNA. Human topo I engages in an intimate interaction with its DNA substrate, wrapping completely around the DNA and burying a total of ∼4700 Å2 of solvent-accessible surface area (∼2400 Å2 of the protein and ∼2300 Å2 of the DNA) upon complex formation (Fig.4, A and B). All three subdomains (I, II, and III) of the core plus the COOH-terminal domain interact with DNA, burying approximately 950, 80, 1050, and 330 Å2 of solvent-accessible surface area of each of these protein elements, respectively. Clearly, DNA interactions involving subdomain II and the COOH-terminal domain are quite limited but those involving subdomains I and III are very extensive. Subdomains I and III make up in large part the molecular clamp that closes around the DNA before cleavage and covalent attachment. In keeping with the nearly identical protein and DNA conformations in the two crystal forms, the protein-DNA contacts made in the noncovalent complex are essentially the same as those made in the covalent complex (Fig. 4, A, F, and G).

Figure 4

The structure of human topo I in complex with DNA. (A) The structures of the covalent (in the standard protein coloring scheme, Fig. 1A) and noncovalent (white) complexes of reconstituted human topo I with DNA superimposed and viewed as in Fig. 1A. The DNA oligonucleotide from the covalent complex is colored as follows: blue for the intact strand, and magenta and pink for the scissile strand upstream and downstream of the cleavage site, respectively. The protein completely wraps around the DNA oligonucleotide, contacting 4 bp upstream and 6 bp downstream of the cleavage site. The positively charged nose-cone helices on the cap of the enzyme are positioned above the major groove of the DNA, but do not directly contact the DNA. The position of the cleavage site is indicated. The rms deviation between all equivalent protein atoms of the two complexes is 0.67 Å, and the rms deviation between all equivalent DNA atoms is 0.44 Å. (B) The noncovalent human topo I–DNA complex viewed down the axis of the DNA (in approximately the same orientation as Fig. 1B). The protein is represented as a coil with the standard domain colors as defined above, and the DNA is rendered as a ladder in magenta. Residues at positions of known CPT-resistant mutations are shown as space-filling atoms in tan. The interaction of the lips of the molecule, which brings together core subdomains I (yellow) and III (red) and completes the wrapping of the enzyme around the DNA, is indicated on the right. (C) Stereoview of a region with extensive interactions between core subdomain I and the DNA oligonucleotide. A coil representation of residues 410 to 429 and the side chains of four residues that contact the DNA are shown in yellow. The interactions between the side chains and the DNA phosphate groups are indicated, as is the interaction between the main chain amide nitrogen of Thr411 (blue) and the phosphate between the −3 and −4 nucleotides of the scissile strand. The intact strand of the DNA is rendered in blue, and the regions of the scissile strand upstream and downstream of the cleavage site are rendered in magenta and pink, respectively. (D) Stereoview of protein-DNA interactions in the region of the lips of human topo I. Residues 361 to 369 of core subdomain I are shown in yellow, and regions of core subdomains III near residues 532 to 533 and 489 to 501 are shown in red. Direct contacts between the protein and the DNA are indicated, including five involving main chain amide nitrogens (blue). Direct contact is apparent in the minor groove between Lys532 and the Thy nucleotide base at the −1 position on the scissile strand. Residues that make up the lip interaction between subdomains I and III (His367, Pro368, Lys369 of subdomain I and Glu497, Thr498, and Ala499 of subdomain III) are also shown; the only direct contact between these regions of the molecule is the salt bridge between Lys369and Glu497. Residues at positions of known CPT-resistant mutations are indicated in tan. The DNA is rendered with the same color convention as in (C), and the position of the +1 bp is indicated. (E) Stereoview of protein-DNA contacts near the active-site residues in the noncovalent topo I DNA. The coloring scheme is identical to that of Fig. 4, C and D, with the addition of the active-site residues in cyan and the coil in green to represent the COOH-terminal domain. The tight network of protein-DNA phosphate interactions, mostly involving main chain amide nitrogens, is apparent just across the minor groove from the active site. These interactions are further stabilized by a helix dipole from the 20-residue α helix 16 to the phosphate group between the −4 and −5 nucleotides of the intact strand. (F) Schematic representation of the protein-DNA interactions in the noncovalent complex of human topo I with DNA. Interactions between protein side chain and main chain atoms with DNA phosphate groups and minor-groove atoms closer than 3.5 Å are indicated. The protein-DNA contacts are limited almost exclusively to protein-phosphate interactions, and the protein only contacts the central 10 bp around the cleavage site. The side chains of Lys436, His632, and Asn722 and the main chain amide nitrogens of Asn491, Gln633, and Ala635 make contacts closer than 3.5 Å in this complex but not in the covalent complex (Fig. 4G; note that residues 628 to 635 are disordered and are not present in the structure of the covalent complex). In addition, the side chain of Arg364 makes a different contact in this complex relative to the covalent complex (Fig. 4G). The sugar-phosphate backbone of the oligonucleotide is colored as described in (A). (G) Schematic representation of the protein-DNA interactions in the covalent complex of human topo I with DNA. The coloring and numbering scheme are the same as in (F). The side chains of His266, Lys720, and Asn745 make contacts closer than 3.5 Å in this complex but not in the noncovalent complex (Fig. 4F). The two schematics in (F) and (G) provide exact information about the positions of the modified and unmodified bases used in the two reconstituted complexes. None of the substituted iodine or bromine atoms is in contact with the protein.

The nose-cone helices of the “cap” of the enzyme offer a highly positively charged surface to the sugar-phosphate backbone, yet the DNA declines to interact directly with these positively charged residues in both the covalent and noncovalent complexes. Elsewhere, extensive protein-DNA contacts do occur, including residues 410 to 429 from subdomain I (Fig. 4C), which position β strand 11 into the major groove roughly opposite the cleavage site. The regions containing the lips between subdomains I and III are also in close contact with the DNA (Fig. 4, B and D). As expected, residues near the active site make several DNA contacts, especially directly across the minor groove from the cleavage site (Fig. 4E). Moreover, the dipole of helix 16 helps to stabilize the protein-DNA interactions by bringing its NH2-terminus close to the phosphate between the −4 and −5 nucleotides of the intact strand.

The protein contacts only the central 10 bp of the DNA (that is, positions −4 to +6), and the vast majority of these contacts involve the DNA phosphate groups (Fig. 4, F and G). The total number of protein-DNA contacts closer than 3.5 Å is 30 for the noncovalent complex and 27 for the covalent complex, and several DNA contacts are made by main chain amide nitrogen atoms as hydrogen bond donors (Fig.4, C to G). A total of 26 residues in the noncovalent complex and 24 residues in the covalent complex participate in these direct protein-DNA contacts. There is only one base-specific contact, a 2.6 to 2.9 Å hydrogen bond in the minor groove between Lys532 and the O-2 carbonyl oxygen of the −1 thymidine base on the scissile strand (Fig. 4D). This interaction is probably related to the preference observed for thymidines at the scissile −1 position in strong breaks sites (18), because thymidine is the only nucleotide base in which the two lone electron pairs of the O-2 atom are freely available to accept a hydrogen bond from the ɛ-amino group of Lys532. In general, topo I makes extensive protein-DNA contacts, similar to the way certain restriction enzymes wrap around their target DNA, but with only one base-specific contact and little evidence for sequence-specific DNA recognition.

The essential Tyr723. The electron density at the active site of the covalent reconstituted complex reveals that the catalytic Tyr723 is indeed covalently attached by a phosphodiester bond with the 3′ end of the scissile strand (Fig. 5A). This covalent attachment was evident even in the original maps with experimental phases at 3.2 Å resolution and was confirmed by capillary electrophoresis of washed and dissolved crystals (not shown). The free 5′-sulfhydryl of the cleaved strand had not carried out the religation reaction, in accordance with biochemical studies (34). The phosphate of the tyrosine-DNA phosphodiester bond makes close interactions with the guanidinium groups of Arg488 (at a distance of 2.8 Å) and Arg590 (at 2.6 Å). Both arginines are implicated in the reaction mechanism of the enzyme, as described in the accompanying report (20).

Figure 5

Covalent and noncovalent complexes. (A) Close-up of the 2.1 Å 2|F obs| − |F calc| electron density map of the covalent complex near the active site, contoured at 1.5σ, revealing the covalent 3′-phosphate-tyrosine bond between Tyr723 and the broken strand of the DNA. (B) Close-up of the 2.5 Å 2|F obs| − |F calc| electron density map of the noncovalent complex near the active site, contoured at 1.5σ, revealing the inactive Tyr723→Phe mutation and the intact DNA oligonucleotide.

The electron density at the active site in the noncovalent complex shows unequivocally that residue 723 is a phenylalanine in the mutant enzyme and that the DNA strand is intact (Fig. 5B). When the covalent and noncovalent complexes are superimposed, it is observed that (i) the phenyl rings of Tyr723 and Phe723are rotated ∼20° relative to one another and the Cα positions of these residues are shifted by ∼0.5 Å; (ii) the conserved residues Arg488 and Arg590 are displaced by rms deviations, respectively, of 0.45 and 0.27 Å when averaged over all atoms; (iii) the 5′ bridging oxygen of the +1 nucleotide of the scissile strand is shifted ∼0.8 Å with respect to the free 5′-sulfhydryl group; and (iv) the phosphorous atoms of the scissile phosphates have moved 1.6 Å with respect to one another. Thus, the differences between the covalent and noncovalent complexes of human topo I with DNA are confined mainly to the catalytic tyrosine and the scissile phosphate group. The residues around the active site, including the catalytic arginines 488 and 590, are shifted minimally, as are the deoxyribose rings upstream and downstream of the cleavage site. Hence, the formation of the covalent complex does not seem to require marked structural changes in the protein or the DNA.

Camptothecin binding. Camptothecin (Fig.6A) is a cytotoxic plant alkaloid that rapidly blocks both DNA and RNA synthesis in treated cells and has emerged as a potent anticancer compound (5, 21,22, 30). Topo I is the sole intracellular target for CPT and CPT derivatives (23), which act by stabilizing the covalent topo I–DNA complex. In in vitro studies, CPT and its analogs inhibit plasmid relaxation by human topo I and increase the yield of covalent intermediates when reactions are stopped with SDS (22, 26, 40). The observations reviewed below provide the rationale for a provisional placement of CPT into the structure of the covalent topo I–DNA complex shown in Fig.6B.

Figure 6

Proposed CPT binding mode. (A) A schematic representation of the key hydrogen bond and ring-stacking interactions made between the human topo I–DNA covalent complex and CPT in the proposed CPT binding mode. The atomic nomenclature for CPT is also indicated. (B) Stereoview of the proposed binding mode of CPT to the covalent human topo I DNA complex. The active lactone form of CPT (20-S-camptothecin, in green) is shown stacked between the terminal +1 guanine nucleotide from the cleaved strand (+1 Gua, in yellow, which is reoriented from the observed position as described below), and the side chain of Asn722, which provides interactions with the A-ring of CPT (the cleaved strand is rendered in light and dark magenta upstream and downstream of the cleavage site, respectively). The carbonyl oxygen at the 17 position in CPT makes a hydrogen bond with the NH2 group on the pyrimidine ring of the +1 cytosine. The side chains of active-site residues Tyr723, Arg488, and Arg590are shown in cyan. The side chain residues that, if singly mutated, result in a CPT-resistant phenotype [Phe361, Gly363, and Arg364 of region 1 (see text); Asp533 and Asn722 of region 2] are shown in tan. The side chain conformations of Arg364 and Asp533 have been altered slightly from the final structure of the covalent complex to allow for optimal hydrogen bonding to the double-bonded lactone oxygen and the hydroxyl at the 20-Schiral center of CPT, respectively. Modifications to the 10 and 11 positions of CPT may require some minor shifts in the positions of residues Lys720 and Leu721 of topo I, which exhibit relatively high temperature factors (for example, 55 to 65 Å2) in the structure of the covalent complex. The proposed conformation of the +1 Gua nucleotide was inspired by flipped-out bases observed experimentally by Sussman and co-workers (62), but was further optimized by rotations about bonds in the intact phosphate between the +1 and +2 nucleotides. Because this base is now a terminal nucleotide in the cleaved strand, it is less contrained by the ribose-phosphate backbone and is more free to rotate to positions outside the DNA duplex.

Numerous structure-activity studies have identified several design principles for CPT analogs and have also established a direct correlation between the ability of CPT derivatives to stabilize the covalent topo I–DNA intermediate and their ability to kill cancer cells (40, 41). The structural features of CPT that are essential for activity include the 20(S)-hydroxyl (42), the pyridone moiety of the D-ring, the lactone moiety of the E-ring (24), and the planarity of the five-membered ring system (Fig. 6A) (43). Hence, the C-D-E rings of CPT cannot be altered without severely affecting its ability to block religation by topo I. In contrast, modifications to the 9, 10, and 11 positions of the A-ring and the 7 position of the B-ring are generally well tolerated and in many cases enhance the potency of the CPT analog in both in vitro and in vivo studies (5). Furthermore, substantial evidence indicates that CPT binds reversibly only after cleavage and covalent attachment of the enzyme to the DNA (27).

CPT preferentially enhances DNA breakage at sites with a guanine base at the +1 position on the scissile strand, immediately downstream of the cleavage site (44). Moreover, studies by Pommier and co-workers have shown that chemically reactive analogs of CPT that contain a chloromethyl addition to the C-7 position can alkylate the N-3 nitrogen of the +1 guanine base after cleavage (45). It is inferred from these and other studies (32) that upon cleavage and covalent attachment, the +1 guanine can assume a conformation in which it is stacked against the five-membered CPT ring system such that the guanine N-3 is in close proximity to the C-7 of CPT (45).

A small number of CPT resistance mutants (CPTr) have been reported for the topo I enzymes from human, yeast, and hamster (31, 32) (Figs. 2A and 4, B, D, and E). Projecting all known CPT escape mutants from the various species onto the three-dimensional structure of human topo I suggests that there are four regions of the protein that can be mutated to produce a CPTr enzyme: region 1: residues Gly717 to Asn722, which lie in close proximity to the active-site Tyr723 (Fig. 4E); region 2: the interface between α helix 13 (residues Lys532 to Ser534) in core subdomain III, which contains the human CPTr mutant Asp533→Gly, and a loop segment (residues Phe361 to Met370) between β strands 6 and 7 of core subdomain I, which contains the human CPTrmutations at positions 361, 363, and 364 (Fig. 4D); region 3: Thr729, which lies deep within the hydrophobic core of the COOH-terminal domain (Fig. 4B); and region 4: a conserved hydrophobic segment of core subdomain III, near α helix 13, involving residue Gly503 (Fig. 2A). The amino acid residues at positions 503 and 729, which lie in regions 3 and 4, are far removed from the active site and from the DNA oligonucleotide and hence are not directly taken into account in our model for CPT binding. These residues may play a structural role in the proper packing of the COOH-terminal and core domains and may affect CPT efficacy by interfering with the positioning of catalytic or CPT-binding residues.

Combining our crystallographic information with these chemical and biochemical observations, we propose a hypothetical CPT binding mode (Fig. 6A). We chose to model CPT into the structure of the covalent complex because it is the relevant target of CPT binding. The locations of CPTr side chains suggest that the CPT binding site spans the DNA duplex from region 1 of the protein to region 2. An attractive feature of this binding mode is that carbon positions 7 and 9 of CPT, which are amenable to chemical modification, are facing out into open space. Thus, modifications to the C-7 and C-9 positions of CPT are allowed and could form favorable interactions with distant protein or DNA atoms that would explain the increased potency such derivatives have relative to unmodified CPT. Insertion of CPT between regions 1 and 2 requires the movement or displacement of the +1 bp immediately downstream of the cleavage site. This movement could be achieved by repositioning the guanine nucleotide at the +1 position of the scissile strand outside of the DNA duplex and allowing CPT to occupy the space vacated by the base, such that a reasonable hydrogen bonding network can be established between the Asp533 and Arg364 side chains and the immutable 20(S)-hydroxyl and lactone moieties of CPT (Fig. 6, A and B). Mutations of the Phe361 and Gly363 residues are likely to mediate CPT resistance by disrupting the conformation of the loop that holds Arg364.

In this binding mode, the terminal carbon (C-18) of the CPT ethyl group has been rotated slightly relative to its position in the CPT crystal structure (46) to remove steric clashes with the side chain of Asp533 and the −1 nucleotide base. A CPT molecule docked in this way would explain the necessity for an S configuration at the C-20 position. The inactive R-diastereoisomer of CPT would fail to bind to right-handed DNA in this manner because of major steric clashes with the +2 bp.

Support for the displacement of the +1 guanine base from its normal base-paired position comes from the alkylation studies, which indicate that the N-3 position of this guanine is in close proximity to the C-7 position of CPT in the covalent complex (45). This proximity has been achieved by the aforementioned repositioning of the terminal guanine of the broken strand, which allows the base to stack on top of the planar CPT (Fig. 6B). In this configuration, the A-ring of CPT forms an additional stacking interaction with the planar conjugated side chain of Asn722. The intact cytosine nucleotide at the +1 position in the noncleaved strand further stabilizes this binding mode by forming a hydrogen bond between its N-3 amino group and the pyridone oxygen of the D-ring of CPT. In the proposed topo I-DNA:CPT complex, the free 5′ hydroxyl group is over 4.5 Å removed from the phosphate group to be attacked for religation. Hence, the enzyme is trapped in its covalent complex with DNA.

  • * These authors contributed equally to this work.

  • Present address: Emerald BioStructures, 7865 Northeast Day Road West, Bainbridge Island, WA 98110, USA. E-mail: emerald_biostructures{at}

  • To whom correspondence should be addressed at Biomolecular Structure Center, School of Medicine, University of Washington, Seattle, WA 98195, USA. E-mail: hol{at}


View Abstract

Navigate This Article