Research Article

Structural Basis of Transcription Initiation: An RNA Polymerase Holoenzyme-DNA Complex

See allHide authors and affiliations

Science  17 May 2002:
Vol. 296, Issue 5571, pp. 1285-1290
DOI: 10.1126/science.1069595


The crystal structure of Thermus aquaticus RNA polymerase holoenzyme (α2ββ′ωσA) complexed with a fork-junction promoter DNA fragment has been determined by fitting high-resolution x-ray structures of individual components into a 6.5-angstrom resolution map. The DNA lies across one face of the holoenzyme, completely outside the RNA polymerase active site channel. All sequence-specific contacts with core promoter elements are mediated by the σ subunit. A universally conserved tryptophan is ideally positioned to stack on the exposed face of the base pair at the upstream edge of the transcription bubble. Universally conserved basic residues of the σ subunit provide critical contacts with the DNA phosphate backbone and play a role in directing the melted DNA template strand into the RNA polymerase active site. The structure explains how holoenzyme recognizes promoters containing variably spaced –10 and –35 elements and provides the basis for models of the closed and open promoter complexes.

The structure of the bacterial RNA polymerase (RNAP) holoenzyme (1) reveals the architecture of the molecular machinery at the heart of gene expression. Nevertheless, many questions about transcription initiation remain: How does holoenzyme recognize variably spaced –10 and –35 hexamers of promoter DNA? How is the transcription bubble formed? Here, we report the crystal structure of Thermus aquaticus(Taq) RNAP holoenzyme (subunit composition α2ββ′ωσA) complexed with a promoter DNA fragment at 6.5 Å resolution, which begins to answer these questions.

The interaction of RNAP holoenzyme (R) with promoter DNA (P) initiates a series of structural transitions from the initial closed promoter complex (RPc) to the transcription-competent open complex (RPo). The double-stranded DNA is melted over a region spanning the transcription start site at +1, in the following process [reviewed in (2)]:Embedded ImageI represents at least one, and possibly more, intermediate states of the complex. RPo is in rapid equilibrium with I, and this equilibrium depends on many factors, including temperature, Mg2+ concentration, and promoter sequence (2). This presents challenges for structure determination, where large quantities of a homogeneous complex must be isolated. To overcome this, we used fork-junction DNA (3) as the template for complex preparation. Fork-junction DNA, which contains a double-stranded –35 element and a mostly single-stranded –10 element (Fig. 1A), forms stable complexes with RNAP holoenzyme (fig. S1A), independent of temperature. The base pair at –12 is required for optimal binding (3), which suggests that the fork-junction template mimics the structure of the upstream edge of the transcription bubble in RPo. The fork-junction sequence used here (4) also contains an extended –10 motif (5). The holoenzyme/fork-junction DNA complex (RF) mimics many properties of RPo, including the following: (i) RF, like RPo, is resistant to the inhibitor heparin (fig. S1A), (ii) mutations in both the promoter and the RNAP that are deleterious to open-complex formation cause parallel reductions in fork-junction binding affinity, and (iii) formation of RF is a multistep process, and some of the intermediates along the pathway share common properties with the intermediates in RPoformation (6). Moreover, we also crystallized a complex of holoenzyme with the same promoter fragment except that it was entirely double-stranded from –40 to –7. Difference Fourier analysis revealed no changes; the –12 position was base paired, and the added template strand from –11 to –7 lacked density because of disorder. Thus, the structure of the fork junction is an excellent model system to study the strucural properties of RPo.

Figure 1

Fork-junction DNA and electron density map. (A) Synthetic DNA oligonucleotides used for complex formation and crystallization. The numbers above denote the DNA position with respect to the transcription start site at +1. Downstream corresponds to the direction of RNAP movement during transcription. Mutations in the bottom DNA strand cause corresponding mutations in the RNA transcript, defining it as the template (versus the nontemplate) strand. The DNA sequence is derived from the full con promoter (4), with –35 and –10 elements (shaded yellow and labeled) as well as an extended –10 element (shaded red and labeled). (B) Stereo view of the Taq RNAP holoenzyme/fork-junction DNA complex. The α-carbon backbone of ω is colored white, β cyan, β′ pink, and σ orange (the α subunits are not visible). The DNA template strand is colored dark green, and the nontemplate strand is light green, except for the –35 and –10 elements, which are colored yellow. The visible structural domains of σ (σ2 and σ4) (1,9) are labeled. The direction of transcription (downstream) is to the right. The experimental electron density map, calculated using observed amplitude (F o) coefficients, is shown (blue net, contoured at 1.5σ), and was computed using multiple isomorphous replacement phases (Table 1), followed by density modification. The view is sliced at a level just in front of the DNA to reveal the β′ NH2-terminal Zn2+-binding domain and the associated Zn2+ (labeled, shown as a green sphere). Shown in red is a difference Fourier map, calculated using (∣F o EMTSF o native∣) coefficients (Table 1), revealing the Hg-binding site that was used to locate the Zn2+-site.

Crystallization and structure determination.

RF formed large single crystals, but diffraction from the crystals was highly anisotropic (4.5 × 6.5 Å). The structure was solved at 6.5 Å resolution by the method of multiple isomorphous replacement (Table 1). At this resolution, the major and minor grooves of the double-stranded DNA, as well as the 3′ single-stranded tail of the fork-junction nontemplate strand, were readily apparent in the electron density maps (Figs. 1B and S1B). Moreover, the protein/solvent boundary was clearly discernible, allowing the Taq core RNAP structure (7,8) to be placed manually into the density. This rough placement of the core RNAP, combined with a close examination of the map, revealed an essentially one-to-one correspondence between rodlike densities in the map and α-helices in the protein (Figs. 1B and S1B). The core RNAP structure was divided into five “domains” [table s1 of (1)] that were fit manually as rigid bodies into the density, mainly by aligning α-helices with the rodlike densities. Substantial conformational changes in the core RNAP structure were required to fit the density map (1). After rigid body refinement of the core RNAP domains, the three structural domains of σA (9), as well as the DNA model, were readily placed into the map. Even though individual base pairs were not visible in the map, the alignment of the DNA along the double-helical axis (with respect to the protein) was fixed based on symmetry arguments (10). A second round of rigid body refinement, in which the σ domains and DNA were individually positioned along with the original core RNAP domains, resulted in the model that was used for the molecular replacement search to solve the 4 Å resolutionTaq RNAP holoenzyme structure (1). Eventually, the final holoenzyme model derived at 4-Å resolution was combined with the DNA model, conformations of some domains were altered to fit the density map, and a final round of rigid body refinement was executed.

Table 1

Crystallographic analysis.

View this table:

Overall structure.

In RF, the fork-junction DNA lies across one holoenzyme face, completely outside the RNAP active site channel (Fig. 2). All of the sequence-specific RNAP contacts with the core promoter elements (–10, extended –10, and –35) are mediated by the σ subunit. The β′ NH2-terminal Zn2+-binding domain (β′ZBD) may contact the DNA backbone in the spacer between the extended –10 and –35 elements at –22 on the template strand and –27 on the nontemplate strand (Figs. 1B and 2).

Figure 2

Taq RNAP holoenzyme/fork-junction DNA structure. Views of the holoenzyme/fork-junction DNA complex. The RNAP holoenzyme is shown as a molecular surface, color coded as follows: αI, αII, ω, gray; β, cyan; β′, pink; and σ, orange. The molecular surface of σ is rendered partially transparent, revealing the α-carbon backbone worm (bright orange) inside. Protein surfaces in contact with the DNA (<4 Å) are colored green. These occur exclusively on σ. The DNA phosphate backbones are shown as worms, with the template strand (t) dark green, the nontemplate strand (nt) light green, except the –35 and –10 elements are yellow, and the extended –10 element is red. (A) Overall view of the complex. The view is similar to that of fig. 1A of (1), except that it is rotated about 45° conterclockwise around thez axis (perpendicular to the page) such that the DNA double-helical axis is roughly horizontal. The downstream and upstream directions are indicated. A numbering scale for the DNA position with respect to the transcription start site (at +1) is shown above the DNA. (B) Magnified view showing only a part of the complex, similar to the view of (A), with the downstream direction to the right. A numbering scale for the DNA position with respect to the transcription start site (at +1) is shown above the DNA. A pink line denotes the direction of the DNA double-helical axis (38), with a small kink at about –25, and a sharp bend at about –16. Obscuring parts of the β subunit in front have been removed (the outline of β is shown as a cyan line), revealing the structural features inside the main RNAP channel. The β′ZBD, β′ lid, and β′ rudder are labeled. The active site Mg2+ is shown as a magenta sphere.

The DNA is contacted by RNAP from only one side (Fig. 2), which explains footprinting data (11–14). The B-form DNA is relatively straight from –41 to –26 and –24 to –17. Centered at –25, the DNA bends 8° toward the major groove facing the RNAP β′ZBD (Fig. 2B). At –16, the DNA makes a sharp 37° bend toward the RNAP.

Conformational changes.

A comparison of holoenzyme within RF to holoenzyme alone (1) reveals conformational changes in two mobile modules (Fig. 3) [table s1 of (1)]. In RF, the RNAP clamp domain (magenta, in Fig. 3), along with σ2bound to it, rotates in toward the RNAP channel by 3°, closing the RNAP channel even further by about 3 Å relative to the holoenzyme alone. The β flap (blue, in Fig. 3), along with σ4bound to it, rotates 4°, resulting in the movement of the σ4 –35 element recognition helix (9) by about 6 Å in the downstream direction (Fig. 3).

Figure 3

Conformational changes. The superimposed α-carbon backbones of the Taq RNAP holoenzyme alone (1) and the holoenzyme within the fork-junction DNA complex are shown as worms (view the same as Fig. 2A). The structure of holoenzyme alone is colored gray (core RNAP) and black (σ). The two modules that move in the holoenzyme-DNA complex as compared with the holoenzyme alone are colored as follows: clamp + σ2, magenta and orange (respectively); β flap + σ4, blue and orange (respectively). The phosphate backbones of the DNA in the holoenzyme/DNA complex are shown as ribbons and colored green (template strand, t) and light green (nontemplate strand, nt). The downstream direction is indicated. The movements of the mobile modules from the holoenzyme structure to their positions in the holoenzyme-DNA complex are indicated by the arrows.

Holoenzyme-promoter interactions.

At 6.5 Å resolution, protein-DNA contacts were analyzed in terms of the vicinity of amino acid side chains to the DNA and whether alterations of rotamers could bring the side chains near the DNA. Within σ conserved regions 2.2 to 3.0, four distinct determinants have been implicated in direct interactions with promoter DNA. In region 3.0, two residues (His278 and Glu281 of TaqσA, corresponding to His455 and Glu458 of Escherichia coli σ70) are involved in recognition of the extended –10 element (15). Glu281 was critical for recognition of the extended –10 sequence, whereas His278 appeared to play a nonspecific DNA binding role. Both residues (red, in Fig. 4) are exposed on the surface of the σ region 3.0 α-helix, facing the major groove of the extended –10 DNA. Glu281 may be within reach of the nontemplate strand T at –13. His278 appears to be too far away from the extended –10 bases but could interact with the phosphate backbone of the nontemplate strand at positions –17/–18.

Figure 4

Holoenzyme contacts with downstream promoter elements. At the lower right, a view of the holoenzyme/fork-junction DNA complex is shown [view the same as Fig. 1A of (1)]. The boxed area denotes the portion magnified in the main part of the image. The holoenzyme is shown as a molecular surface (color coding: αI, αII, ω, gray; β, cyan; β′, pink; and σ, orange). The template DNA (t) is green, the nontemplate DNA (nt) is light green, except for the –35 and –10 elements, which are yellow, and the extended –10 element is red. In the single-stranded part of the nontemplate strand (–11 to –7), the DNA bases have been removed because their positions were not determined at the resolution of this analysis. Residues of σ, determined from genetic studies to be important for downstream promoter binding, are labeled and color coded as follows: extended –10 element recognition, red (15); –10 element recognition, green (16); –10 element melting/nontemplate strand binding, yellow (17–19); and universally conserved basic residues important for DNA binding, blue (20). The downstream direction is toward the lower right.

Region 2.4 of σ contains allele-specific suppressors of promoter mutations in the –10 element, implicating these residues in base-specific interactions with the –10 element [reviewed in (16)]. In group 1 σs, Gln260 and Asn263 (corresponding to Gln437 and Thr440 in E. coli σ70) suppress promoter mutations at –12 within the –10 element. In the structure, both residues are exposed, facing the major groove of the DNA near –12. Gln260 is easily within reach of the –12 bases and could interact with the nontemplate strand T or the template strand A (Fig. 4). An amino acid at position 263 (whether Asn or Thr), is out of reach of the –12 bases, but small adjustments of the structure might bring it within reach of the nontemplate strand T.

In σ region 2.3, highly conserved aromatic residues (Phe248, Tyr253, and Trp256, corresponding to Tyr425, Tyr430, and Trp433 of E. coli σ70) play a role in promoter melting, at least partly through sequence-specific binding of the nontemplate strand of the melted –10 element in the open complex (17–19). These residues appear ideally positioned to interact with unpaired bases of the single-stranded tail of the nontemplate strand DNA, which crosses the surface-exposed aromatic residues (Fig. 4). Phe248 is closest to bases at the –8/–9 positions, whereas Tyr253is closest to bases at –9/–10 (Fig. 4). Most intriguing is Trp256, which is positioned to stack on the exposed face of the –12 base pair, forming the upstream edge of the transcription bubble (Fig. 4). Trp256 may also be able to interact with the exposed base at the –11 position.

Universally conserved basic residues in regions 2.2 and 2.3 (Arg237 and Lys241, corresponding to Arg414 and Lys418 of E.coli σ70) are critical for DNA binding, probably in a non–sequence specific manner (20). Both positively charged residues are positioned to interact with the negatively charged DNA backbone of the nontemplate strand at the –13/–14 positions (Arg237) or at –15 (Lys241) (Fig. 4).

The 2.4 Å resolution structure of TaqσA 4 complexed with –35 element DNA (9) confirmed earlier genetic studies (16) indicating that residues of the recognition helix in the σ4 helix-turn-helix motif bind the –35 element sequence. In that structure, the DNA is bent 36° around the recognition helix, consistent with footprinting data (21). In contrast, in the RF complex, the DNA from –41 to –26, which includes the –35 element, is straight (Fig. 2B). In addition, the σ4 recognition helix is shifted upstream about 6 Å, so that the sequence-specific interactions with the –35 element (9) could not occur. There are two possible explanations for this discrepancy between the high-resolution σ4-DNA structure and the RF structure: (i) The crystals of the RF complex may have captured a state, likely late in the series of steps involved in open-complex formation, in which σ4 interactions with the –35 element DNA are no longer sequence specific. However, the available evidence from footprinting and crosslinking studies indicates that the specific σ4/–35 element interactions are the first RNAP/promoter interactions to be established and that these persist throughout the process of RPo formation (12, 14, 22–24), making this explanation seem unlikely. (ii) Our preferred explanation is that the σ4/DNA interactions observed in the RF complex are distorted by crystal packing. The upstream end of the fork-junction DNA packs against the upstream end of a crystallographically related DNA molecule, forming a pseudo-continuous double helix (10). This may not be compatible with a bend in the DNA near the upstream end. We suggest that crystal packing forces dictate that the upstream DNA be straight, and these forces are of sufficient strength to distort the sequence-specific σ4/–35 element DNA interactions.

Model of the complete open complex.

The RF structure reveals the disposition of the double-stranded promoter DNA from –41 to –12 and that of the nontemplate strand from –11 to –7 (Fig. 1A). However, the –35 element DNA and upstream may be distorted by crystal packing, and additional upstream and downstream DNA that interacts with holoenzyme is absent. Therefore, we constructed a model of RPo (Fig. 5), containing RNAP holoenzyme from the RF structure, as well as both strands of the DNA from –60 to +25.

Figure 5

Models of RPc and RPo. Views of holoenzyme/promoter DNA complexes along the pathway of open complex formation, modeled as described (see online material). The viewing angle is obtained from Fig. 2A by a 65° rotation around the horizontal axis. Double-stranded DNA is shown as atoms, and single-stranded DNA is shown as phosphate backbone worms with only the phosphate atoms visible. The template strand (t) is green, the nontemplate (nt) is light green, except for the –35 and –10 elements, which are yellow; and the UP elements, extended –10 element, and transcription start site on the template strand (+1) are red. The direction of transcription (downstream) is to the right. RNAP holoenzyme is shown as a molecular surface, color coded as follows: αI, αII, ω, gray; β, cyan; β′, pink; and σ, orange. The possible disposition of the αCTDs (drawn as gray spheres, labeled “I” and “II”) on the UP elements (29) is illustrated. (A) Models of RPc(left) and the final RPo (right). The arrows in between denote that several intermediate steps exist along the pathway between these two states (2). The β subunit is rendered partially transparent to reveal the RNAP active site Mg2+ (magenta sphere) inside the main channel and the transcription bubble and downstream DNA enclosed inside the channel in RPo. In RPc, a numbering scale for the DNA position (–60 to +25) with respect to the transcription start site (+1) is shown above the DNA. The labeled domains of β (β1 and β2) are defined in table s1 of (1). In RPo, RNA occupying the i and i+1 sites is shown as orange atoms. Sites of DNase I hypersensitivity in footprints of open complexes are denoted by open arrows in exposed minor grooves at approximately –45, –35, and –25 (21). The direction of view for Fig. 6 is also denoted. (B) Magnified view of RPo, showing the details of the core promoter interactions, transcription bubble, and downstream DNA. Obscuring portions of the β subunit in front have been removed (the outline of β is shown as a cyan line) to reveal the structural elements inside the main RNAP channel. The numbering of downstream DNA positions (with respect to the transcription start site at +1) is shown. RNA occupying the i and i+1 sites is shown as orange atoms. The molecular surfaces of the entire σ subunit, as well as of the β′ lid and β′ rudder are rendered transparent, revealing the α-carbon backbone worms (bright orange and pink, respectively) inside. A 9-residue disordered segment of σ (residues 337 to 345) is shown as a dotted orange line (1). The template strand DNA within the transcription bubble is directed through a protein tunnel framed by σ2 and the σ34 linker underneath, an α-helix of σ3 and the β′ lid on one side, σ2 and the β′ rudder on the other side, and a domain of β (β1) in front, closest to the viewer, but seen only in outline (Fig. 6).

The RF structure, combined with footprinting data (12, 14, 23, 24), also suggests the nature of RPc. In RPc, where there is no detectable strand separation of the DNA, the holoenzyme protects promoter DNA from DNase I and hydroxyl radical cleavage with an approximately 10 base-pair periodic pattern from about –54 to –6. However, there is no protection downstream of −6 by RPc, in contrast to RPo, which protects both strands of the downstream DNA to about +20 (consistent with the enclosure of this DNA within the RNAP main channel) (Fig. 5). The RPcfootprinting data indicate that the RNAP engages the promoter DNA upstream of the –10 element from one face of the DNA in a very similar manner as in RPo. However, in RPc, the DNA downstream of the –10 element is exposed, indicating that it is not engaged with the RNAP. The simplest interpretation is that, in RPc, the downstream DNA extends as a double helix straight past RNAP (Fig. 5A).

In addition to the −10, extended −10, and −35 elements, some promoters contain an UP element (an upstream, A/T-rich sequence, roughly –40 to –60) that is bound by the α-subunit C-terminal domains (αCTDs) in the DNA minor groove [reviewed in (25)]. The αCTD is an 80-residue, globular domain (26) connected to the α NH2-terminal domain by a flexible, 14-residue linker (27). In addition to stimulating transcription up to several hundredfold by binding to the UP element, the αCTDs serve as targets for a wide array of transcription activators [reviewed in (28)]. In the bacterial RNAP structures [core RNAP (7, 8), holoenzyme (1), and the RF structure], the αCTDs and linkers are disordered, but their possible disposition is schematically illustrated in Fig. 5, along with the sites bound by the αCTDs in an UP element–containing promoter (29).

In the RPo model, the upstream DNA bends around the RNAP, primarily due to kinks at about –35 (36°) and –25 (8°). At –16, the DNA makes another sharp 37° bend toward the RNAP. The two DNA strands separate and take different paths, beginning at –11, the upstream edge of the transcription bubble, which extends 15 nucleotides downstream.

The single-stranded, nontemplate DNA of the –10 element (–11 to –7) crosses σ2, where it can interact with the exposed aromatic residues of σ region 2.3 (Fig. 4). The strand from –2 to +4 is held in a groove between two lobes of β (β1 and β2) (Fig. 5A).

The single-stranded template strand, which must enter the RNAP active site channel to base pair with the initiating nucleotide substrates bound in the i and i+1 sites (Fig. 5), is diverted through a tunnel, completely enclosed on all sides by protein (Fig. 6). The tunnel is formed by parts of σ2, σ3, β1, the β′ lid, and the β′ rudder (Figs. 5B and6). Universally conserved basic amino acids of σ regions 2.4 and 3.0 (Arg259, Lys285, Arg288, and Arg291, corresponding to Arg436, Lys462, Arg465, and Arg468 ofE. coli σ70) are exposed at the entrance of the tunnel, contributing to the electrostatic potential that may direct the template strand into the tunnel (Fig. 6). The path of the template strand then takes it past the σ2–σ3 linker and elements of the β subunit that make up the back wall of the RNAP active site channel, until it passes near the active site (Fig. 5B). The downstream, double-stranded DNA from +5 to roughly +12 is clamped inside another protein tunnel between the β and β′ subunits (Figs. 5B and 6), consistent with footprinting (11–14) as well as functional studies demonstrating the importance of this downstream double-stranded DNA to the stability of the complex (30).

Figure 6

Template strand tunnel. The RPomodel (with αCTDs omitted) is viewed along the direction denoted inFig. 5A, parallel with the template strand as it enters its protein tunnel. (A) Stereo view of the entire RPo model. The RNAP holoenzyme is shown as a molecular surface, color coded as follows: αI, αII, ω, gray; β, cyan; β′, pink; and σ, orange. The molecular surface of σ is rendered partially transparent, revealing the α-carbon backbone worm (bright orange) inside. The DNA phosphate backbones are shown as worms, with the template strand (t) dark green, the nontemplate strand (nt) light green, except for the –35 and –10 elements, which are colored yellow, and the UP elements and extended –10 element, which are red. (B) Magnified stereo view, centered on the template strand tunnel, with the DNA removed. The main protein elements framing the template strand tunnel are shown as α-carbon backbone worms without the associated molecular surface. These are the β1 domain (blue), the β′ lid and β′ rudder (red), and σ2 and σ3 (orange). The side chains of four universally conserved basic residues of σ that frame the entrance to the tunnel are also labeled and shown. At the far end of the tunnel lies the RNAP active site, denoted by the Mg2+ ion (magenta sphere).

A survey of DNase I footprinting data of binary complexes of E. coli RNAP holoenzyme with 33 different promoters revealed a nonrandom pattern of sites that became hypersensitive to nuclease cleavage after RNAP binding (21). The DNase I hypersensitive sites are [(nontemplate/template strands); (−45/−47), ([−33, −34]/−37), and (−25/[−25,−26]). In the model, these sites are centered at three successive exposed minor grooves at about –45, –35, and –25 (Fig. 5A). At –35 and –25, the DNA is kinked (about 36° and 8°, respectively) toward the major groove facing the RNAP, thereby widening the exposed minor groove. Crystal structures of DNase I bound to DNA reveals that the nuclease attacks by binding in the minor groove, widening it by about 3 Å, and bending the DNA about 20° toward the major groove, away from the enzyme (31). Thus, kinking of the DNA in RPo at –35 and –25 by the RNAP in this manner would facilitate DNase I attack, explaining the tendency for DNase I hypersensitivity at these sites. The hypersensitive site around –45 suggests that the DNA contains an additional kink induced by the binding of αCTDI that would further bend the DNA around the RNAP (Fig. 5), but this kink has not been modeled because of the lack of structural information on the αCTD/DNA complex.


The structure of the Taq RNAP holoenzyme/fork-junction DNA complex reveals the disposition of the upstream promoter DNA and the nature of interactions forming the upstream edge of the transcription bubble. All of the sequence-specific contacts with core promoter elements are mediated by conserved regions of the σ subunit. A Trp residue, universally conserved among group 1 σ's (32,33), is ideally positioned to stack on the exposed downstream face of the base pair at –12, forming the upstream edge of the transcription bubble. Universally conserved basic residues in σ regions 2 and 3 provide critical contacts with the DNA phosphate backbone and play a role in directing the melted template strand of the DNA through a tunnel into the RNAP active site. The structure also provides the basis for models of the initial closed and open complexes that are consistent with most of the available biochemical data. In these complexes, the RNAP introduces a series of discrete kinks in the upstream DNA, bending the DNA around the RNAP to increase the available binding interface.

The structure explains how holoenzyme recognizes promoters containing variably spaced –10 and –35 elements. First, plasticity that appears to be inherent in holoenzyme allows repositioning of the β flap and the bound σ4 with respect to the DNA by at least 6 Å (Fig. 3). Second, the RNAP can kink the intervening DNA to correctly position the –10 and –35 elements with respect to each other (Fig. 2B). A close examination of data available in the literature reveals a strong correlation between –10/–35 spacer length and DNase I hypersensitivity around –25/–26 (table S1). Promoters with a 16 base-pair spacer rarely show DNase I hypersensitivity in this region, whereas promoters with a spacer of 18 base pairs or more always show hypersensitivity.

In the holoenzyme of the fork-junction complex, the claws around the RNAP main channel are essentially completely closed. Elements of the protein on each claw, such as the β′ rudder and parts of the β1 domain, interact across the channel by poking through the middle of the transcription bubble, sealing the DNA strands apart (Fig. 5B). This arrangement, presumably maintained during elongation, dictates that the DNA must rotate through the RNAP structure (or vice versa) during translocation, like a screw inside a nut, perfectly tracking the helical pitch of the DNA double helix. This has been observed experimentally (34).

In the model of the complete open complex, the DNA template strand is enclosed in a protein tunnel framed by universally conserved basic amino acids (Fig. 6B). Because open complex formation occurs without breaking covalent bonds in the DNA, the RNAP claws must open at some point during the process of open complex formation to allow the template strand to slip into its channel. Subsequent closure of the claws would then establish the tunnel. This requirement for prior states (intermediates) during the steps of open complex formation with different conformations of the enzyme, combined with the good match between footprinting data and the complete open complex model, leads us to suggest that the complex represented in this holoenzyme/fork-junction structure closely resembles the final RPo.

The RF structure, and the models derived from it, raise key questions that are central to understanding transcription initiation. How is RPo generated from RPc (Fig. 5A)? How do transcription activators interact with the complex to enhance the rate of transcription initiation? The structures and models presented here provide a basis for designing more decisive experiments probing these questions and more.

  • * To whom correspondence should be addressed. E-mail: darst{at}


Stay Connected to Science

Navigate This Article