Real-Time Observation of Transcription Initiation and Elongation on an Endogenous Yeast Gene

See allHide authors and affiliations

Science  22 Apr 2011:
Vol. 332, Issue 6028, pp. 475-478
DOI: 10.1126/science.1202142


Cellular messenger RNA levels are achieved by the combinatorial complexity of factors controlling transcription, yet the small number of molecules involved in these pathways fluctuates stochastically. It has not yet been experimentally possible to observe the activity of single polymerases on an endogenous gene to elucidate how these events occur in vivo. Here, we describe a method of fluctuation analysis of fluorescently labeled RNA to measure dynamics of nascent RNA—including initiation, elongation, and termination—at an active yeast locus. We find no transcriptional memory between initiation events, and elongation speed can vary by threefold throughout the cell cycle. By measuring the abundance and intranuclear mobility of an upstream transcription factor, we observe that the gene firing rate is directly determined by trans-activating factor search times.

Transcription initiation occurs through a series of sequential steps initiated by the binding of gene-specific activators and coactivators and resulting in recruitment of the basal transcription machinery and RNA polymerase (1). Genome-wide chromatin immunoprecipitation (ChIP) experiments have provided detailed information about the occupancy of transcription factors on DNA, and live-cell photobleaching recovery studies on tandem gene repeats indicate that turnover of these factors is generally on the order of seconds (24). These approaches describe the upstream regulators of transcription in a cellular context. However, to gain a functional understanding of transcription networks, it is necessary to observe the output of transcription, namely RNA production. We have developed a single-molecule assay to observe the enzymatic activity of an RNA polymerase II (RNAPII) molecule directly on an active gene in a living cell. This methodology enables direct measurement of promoter clearance, elongation, and termination rates. In Saccharomyces cerevisiae, we find that elongation is deterministic and initiation is stochastic. Moreover, by comparing nascent RNA dynamics with the reaction-diffusion behavior of an upstream transcription factor, we determine that promoter firing rate is dictated directly by the search times in the nucleus required for the factor to find the gene.

To detect RNAPII activity in living cells, we used a green fluorescent protein (GFP)–labeling approach adapted from the MS2 technique (5, 6). A cassette coding for 24 binding sites for the PP7 bacteriophage coat protein was inserted into the 5′ untranslated region (5′UTR) of a target gene (7). A PP7-GFP fusion protein is constitutively co-expressed and binds the stem loops after they have been transcribed, resulting in a GFP-labeled transcript (Fig. 1A) (810). Thus, one can directly measure both the earliest steps in pre-mRNA synthesis and the total dwell time of RNA at the site of transcription. The cell cycle–regulated POL1 promoter (POL1 pro) and PP7 binding site cassette (24 PBS) were knocked in upstream of the endogenous GLT1 gene, which was then found to transcribe similarly to the endogenous POL1 gene (fig. S1). When monitored by using wide-field microscopy, nascent transcription sites (TSs) are detected as diffraction-limited fluorescent puncta in the nucleus, the boundary of which is determined by using a red nuclear pore marker (Fig. 1B). The POL1 promoter was active during late G1/S phase, but activity periods varied from cell to cell and even for different alleles within the same nucleus, as shown in a time series in two different cells (Fig. 1B and movie S1).

Fig. 1

Real-time measurement of fluorescent RNA reveals transcription kinetics. (A) Diagram of the POL1pro-GLT1 reporter gene, obtained by replacing the GLT1 promoter with a cassette containing a his5 marker, POL1 promoter, and 24 PBS. (B) Time-dependent activity of individual reporter genes (PP7-GFP, green; Nup49-tomato, red). At time t = 0 min, both cells show a TS near the periphery of the nucleus. At t = 2 min, an additional TS corresponding to the duplicated gene has turned on in the upper cell (white and blue arrows). At t = 22 min, the nascent bud is visible in both cells, and the TSs of the upper cell are present at opposite sides of the nucleus, but the lower cell TS has turned off. At t = 28 min, the bud continues to grow, and both TSs have turned off. All images are maximum projected z-stacks. Scale bar indicates 3 μm. (C and D) Diagram of RNAPII progression on the gene with corresponding intensity trace (green line). The gray line is the intensity of a cytosolic mRNA; the black line is a background intensity at an arbitrary position in the nucleus measured by using the fitting algorithm (movie S2).

To analyze transcription kinetics, we followed the TS in the nucleus over time. The position and integrated intensity of an individual TS was determined by using a tracking algorithm (11) to generate the spatial trajectory and intensity time trace of the site (Fig. 1, C and D). Three distinct phases in the time trace are evident: (i) the increase in fluorescence signal corresponding to polymerization through the PP7 cassette as PP7-GFP binds recently transcribed RNA stem loops, (ii) a stable intensity of fluorescence corresponding to transcription of the GLT1 gene, and (iii) a sharp drop in fluorescence that corresponds to termination and release of the nascent RNA, which then rapidly diffuses away from the site (movies S2 and S3).

For an actively transcribing gene, many such events will be superimposed, with RNAPIIs in various positions along the gene, necessitating an analysis approach that takes into account these nonequilibrium kinetic processes (Fig. 2, A and B, and movie S4). To extract kinetic parameters from these transcription time traces, we developed an analysis approach that takes advantage of the fluctuations inherent in the measurement (12, 13). An idealized fluorescence time trace for transcription of a single pre-mRNA is shown in Fig. 2C. This process can be described by an autocorrelation function G(τ), which is a discrete autocorrelation over all the transition probabilities for RNAPII (6). The variables are the total dwell time of a transcript (T), which includes elongation and termination, and the transcript initiation rate (c). For long genes, the solution takes the form G(τ)=(Tτ)cT2H(Tτ)(1)where τ is the autocorrelation delay and H denotes the Heaviside step function, which is unity when τ ≤ T and zero when τ > T (Fig. 2D). Therefore, by steady-state observations of TS fluctuations it is possible to measure both the transcript initiation rate and the total time a transcript spends at the TS.

Fig. 2

Direct measurement of transcription initiation and elongation of the POL1p-GLT1 gene with fluctuation analysis. (A) TS intensity trajectory (green) compared to background (black). (B) Diagram of RNAPII occupancy based on the intensity data from (A). At time t1, a single RNAPII is loaded on the gene; at t2, multiple RNAPIIs load in rapid succession before the first has finished polymerization; at t3, the first RNAPII has terminated; at t4, the TS is devoid of nascent RNA. (C) Diagram of a single transcript intensity trajectory. After promoter clearance, RNAPII proceeds through the 24 stem loops, resulting in a discrete fluorescence increase upon PP7-GFP binding. After this 1.5-kb cassette, RNAPII continues through the gene, and the fluorescence is level. Upon completion of the transcript, there is a decrease corresponding to the brightness of a single transcript. (D) Autocorrelation function for transcription. The analytical solution (black curve) is compared to a Monte Carlo simulation (red circles) and the approximation for a long gene (Eq. 1) (gray curve). (E) Autocorrelation for POL1p-GLT1. The autocorrelation (red circles) is fit with Eq. 1. n = 10 cells. Error bars indicate SEM.

The autocorrelation for the POL1p-GLT1 construct reflects deterministic elongation times and no correlation between individual transcripts (Fig. 2E). The autocorrelation shows a linear decay with a sharp transition at the time that corresponds to the dwell time of a transcript at the TS (Τ = 290 ± 30 s) (SEM). The processive nature of pre-mRNA synthesis, consisting of thousands of sequential enzymatic cycles, results in a central-limit process where the total dwell time is distributed around a nonzero mean. Alternatively, a comparable number of micropauses interspersed with polymerization cycles would also produce a similar autocorrelation (14) (fig. S2). In addition, the autocorrelation vanishes at times greater than the dwell time, indicating that there is no transcriptional memory between initiation events. Thus, although the POL1 promoter is only active during a certain stage of the cell cycle, initiation events during this active period are stochastic and uncorrelated and occur with a frequency c = 1.3 ± 0.72 min−1.

We then sought to separate the kinetic processes of initiation, elongation, and termination. By placing the PP7 cassette in either the 5′UTR or 3′UTR of a long gene such as MDN1 (15 kb), one can measure dwell times in which termination plays a lesser or greater role in comparison to elongation (Fig. 3, A to C) (15, 16). With the cassette in the 5′UTR, the dwell time is dominated by the downstream portion of the gene (PP7-MDN1, Fig. 3D, red curve, and movies S5 andS6), where the fluorescence is not changing, leading to the autocorrelation in Eq. 1. With the cassette in the 3′UTR (MDN1-PP7, Fig. 3D, blue curve, and movie S7), the autocorrelation reflects the transitions between N stem loops, which occur with frequency k: G(τ)=kc(23)1[N(N+1)]2ekτi=0N[(Ni)(Nj+1)(2N+i+1)(kτ)ii!](2)In this case, the total dwell time T equals N/k. By using this complementary approach to selectively emphasize early and late events in the lifetime of a nascent RNA, we can separate elongation from termination.

Fig. 3

The housekeeping gene MDN1 shows variable elongation rate. (A) Diagram of the MDN1 gene with the PP7 cassette inserted either in the 5′ or the 3′UTR of MDN1 at the endogenous locus. (B and C) Visualization of TS and cytosolic mRNA (PP7-GFP, green; Nup49-tomato, red) for the PP7-MDN1 and MDN1-PP7 genes, respectively. A TS is designated with a blue arrowhead; cytosolic transcripts are designated by white arrowheads. Scale bar, 3 μm. (D) Diagram of the fluorescence time traces for MDN1 with stem loops in the 5′ or 3′UTR, PP7-MDN1 and MDN1-PP7. (E and F) Intensity trajectories of single PP7-MDN1 and MDN1-PP7 sites. (G) PP7-MDN1 transcription kinetics in G1 cells. The PP7-MDN1 autocorrelation was fit with Eq. 1. n = 10 cells. (H) MDN1-PP7 transcription kinetics in G1 cells. The MDN1-PP7 autocorrelation was fit with Eq. 2. n = 10 cells. (I) PP7-MDN1 transcription kinetics in late S/G2/M. The PP7-MDN1 autocorrelation was fit with Eq. 1. n = 10 cells. Error bars indicate SEM. Cell cycle stage was determined by size, nuclear morphology, budding index, and presence of multiple alleles (fig. S8).

MDN1 transcription dwell times and initiation rates varied over the cell cycle. The MDN1 gene was monitored at various stages of the cell cycle, resulting in fluorescence intensity traces during G1 for PP7-MDN1 and MDN1-PP7 (Fig. 3, E and F). During G1, the mean dwell time of PP7-MDN1 transcripts is T = 770 ± 260 s, and the mean dwell time of MDN1-PP7 is T = 140 ± 30 s (Fig. 3, G and H). From these data, it is possible to directly compute the elongation velocity according to the equation v=(L5'L3')(T5'T3'), where L is the length of the PP7 cassette plus the downstream sequence and T is the dwell time as measured above. The velocity of RNAPII on MDN1 is v = 20 ± 8 bases per second, and the termination time is 70 ± 41 s. The initiation rates are c = 0.16 ± 0.07 min−1 for PP7-MDN1 and c = 0.24 ± 0.1 min−1 for MDN1-PP7. The agreement between initiation rates follows from the processivity of RNAPIIs, which clear the promoter and proceed to termination. In late S/G2 phase, PP7-MDN1 was observed to have a much shorter mean dwell time (T = 310 ± 42 s) and a faster mean initiation rate (c = 0.40 ± 0.18 min−1) (Fig. 3I and fig. S3), resulting in a lower bound estimate of the velocity of 46 ± 6.2 bases per second. This variable dwell time of single RNAPIIs on the MDN1 gene is obscured by population averages of the single-molecule data and can be misinterpreted as RNAPII pausing (figs. S4 and S5).

Because POL1p-GLT1 and MDN1 each exhibit uncorrelated initiation events, gene firing is likely determined by a single rate-limiting step. Such a step might be the binding of a trans-acting factor to DNA, so we used two-photon fluorescence correlation spectroscopy (2PE-FCS) to determine the abundance and reaction-diffusion properties of the transcription factor Mbp1p (17, 18), which binds the POL1 promoter in vivo (2) and is necessary and sufficient for correct cell cycle–regulated expression of POL1 (19). The 2PE-FCS autocorrelation of Mbp1-GFP under endogenous regulation is shown along with a fit to a model where Mbp1-GFP alternates between bound and unbound states (Fig. 4 and fig. S6) (20). The time in the unbound state corresponds to three-dimensional (3D) diffusion within the nucleus, and the time in the bound state corresponds to 1D diffusion along DNA while the factor is nonspecifically bound (21). GFP targeted to the nucleus with a nuclear localization sequence (NLS-GFP) (18) shows simple 3D diffusion. The number of Mbp1-GFP molecules in the nucleus is 350 ± 18.

Fig. 4

Mbp1p-GFP transactivator dynamics in the yeast nucleus. 2PE-FCS autocorrelation of Mbp1p-GFP (red circles) and NLS-GFP (green circles), n = 25 and 17 cells, respectively. Mbp1p-GFP autocorrelation was fit to a model of reaction diffusion on the basis of transitions between free diffusion (light-gray curve) and nonspecific binding to chromatin (dark gray curve). Concentration of Mbp1-GFP = 60 ± 16 nM; D = 0.6 ± 0.1 μm2/s; τ3D = 1.1 ± 0.2 s; τ1D = 0.8 ± 0.1 s. NLS-GFP was fit to a simple diffusion model with D = 2.7 ± 0.4 μm2/s. Error bars indicate SEM.

From these data, it is possible to directly compute the search time (ts), according to the Berg and von Hippel model of facilitated diffusion (22):ts=MN4D1Dτ1D(τ3D+τ1D)(3)where M is the number of base pairs in the nucleus (12 × 106), D1D (5.1 × 105 bp2/s) is the 1D diffusion coefficient measured in vitro (21), and N is the number of factors doing the search. The calculated search time is 52 ± 8 s, which compared to the measured time between initiation events of 46 ± 25 s (Fig. 2E). A modified description of facilitated diffusion that postulates that the search regions are determined by the nucleosome-free DNA results in a search time of 84 ± 11 s (fig. S7) (23). Both models of facilitated diffusion predict a search time that is in quantitative agreement with the direct measurement of transcription initiation frequency for the POL1 promoter. Thus, the initiation rate from this active state shows a rate-limiting step that is consistent with the search time for Mbp1p, suggesting that every encounter leads to a promoter clearance event.

The view of transcription in yeast that emerges from these data is consistent with a model where transcription activation is determined by recruitment of a rate-limiting factor, which then assembles the pre-initiation complex (fig. S7) (24, 25). Remarkably for the POL1 promoter, this recruitment process is highly efficient, but the Mbp1p-DNA complex is also dynamic, as is the basal transcription machinery (26), resulting in single uncorrelated transcription events instead of the bursts of transcription observed for other genes (8, 16, 27, 28). Moreover, transcription rates in yeast are consistent with a search process occurring in a functionally homogeneous nuclear compartment. The single-factor, single-target search time for Mbp1p in the yeast nucleus is ~5 hours. In contrast, the Lac repressor search time in bacteria is ~300 s (21). However, the effective search times are similar, because of the number of factors doing the search (for LacI, 20 molecules; for Mbp1, 350 molecules). It is unclear whether the biophysical aspects of the search process are modified in the metazoan nucleus, where many genes are not expressed and chromatin shows substantial compartmentalization. The live-cell approach described in this report, which uses fluctuation analysis to observe the activity of RNAPII and the reaction-diffusion behavior of transcription factors, will be critical in deciphering transcriptional control at single genes operating in their native context.

Supporting Online Material

Materials and Methods

Figs. S1 to S8


Movies S1 to S7

References and Notes

  1. Materials and methods are available as supporting material on Science Online.
  2. Acknowledgments: The authors thank the following for critical reading of the manuscript: D. Grünwald, T. Lionnet, T. Trček, J. Yao, X. Darzacq, R. Tjian, and M. Larson. D.R.L. thanks W. Zipfel, S. Hess, J. Atilgan, and M. Nicholas for discussions about fluctuation analysis. S. Silverman provided the fkh1fkh2 deletion strain. B. Slaughter provided the NLS-GFP strain and J. Vogel the Mbp-GFP strain. We thank E. Gruss Lipper for her gift founding the Gruss Lipper Biophotonics Center that provided the equipment used in this study. Supported by NIH National Institute of General Medical Sciences grants 57071 and 86217 to R.H.S. There is a patent (no. 6,586,240) associated with the methods for visualization of RNA in living cells as presented in this manuscript.
View Abstract

Stay Connected to Science

Navigate This Article