28
Article Correlating Transcription Initiation and Conformational Changes by a Single-Subunit RNA Polymerase with Near Base-Pair Resolution Graphical Abstract Highlights d Single-molecule fluorescence monitors most steps during transcription initiation d Abortive initiation occurs by RNA polymerase recycling or exchange d Initiation to elongation transition shows multiple branching kinetic pathways d Majority of productive transcription shows no detectable abortive initiation Authors Hye Ran Koh, Rahul Roy, Maria Sorokina, Guo-Qing Tang, Divya Nandakumar, Smita S. Patel, Taekjip Ha Correspondence [email protected] (S.S.P.), [email protected] (T.H.) In Brief Koh et al. report single-molecule transcription that monitors most steps of initiation in real time with near base-pair resolution. They distinguish productive and failed transcription; they also show that a majority of productive transcription occurs without abortive initiation and that most of the abortive RNAs result from multiple rounds of failed transcription events. Koh et al., 2018, Molecular Cell 70, 695–706 May 17, 2018 ª 2018 Elsevier Inc. https://doi.org/10.1016/j.molcel.2018.04.018

Correlating Transcription Initiation and Conformational

  • Upload
    others

  • View
    13

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Correlating Transcription Initiation and Conformational

Article

Correlating Transcription

Initiation andConformational Changes by a Single-Subunit RNAPolymerase with Near Base-Pair Resolution

Graphical Abstract

Highlights

d Single-molecule fluorescence monitors most steps during

transcription initiation

d Abortive initiation occurs by RNA polymerase recycling or

exchange

d Initiation to elongation transition shows multiple branching

kinetic pathways

d Majority of productive transcription shows no detectable

abortive initiation

Koh et al., 2018, Molecular Cell 70, 695–706May 17, 2018 ª 2018 Elsevier Inc.https://doi.org/10.1016/j.molcel.2018.04.018

Authors

Hye Ran Koh, Rahul Roy,

Maria Sorokina, Guo-Qing Tang,

Divya Nandakumar, Smita S. Patel,

Taekjip Ha

[email protected] (S.S.P.),[email protected] (T.H.)

In Brief

Koh et al. report single-molecule

transcription that monitors most steps of

initiation in real time with near base-pair

resolution. They distinguish productive

and failed transcription; they also show

that a majority of productive transcription

occurs without abortive initiation and that

most of the abortive RNAs result from

multiple rounds of failed transcription

events.

Page 2: Correlating Transcription Initiation and Conformational

Molecular Cell

Article

Correlating Transcription Initiationand Conformational Changes by a Single-SubunitRNA Polymerase with Near Base-Pair ResolutionHye Ran Koh,1,2 Rahul Roy,1 Maria Sorokina,1 Guo-Qing Tang,3 Divya Nandakumar,3 Smita S. Patel,3,*and Taekjip Ha1,4,5,6,*1Department of Physics and Center for the Physics of Living Cells, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA2Department of Chemistry, Chung-Ang University, Seoul 06974, Korea3Department of Biochemistry andMolecular Biology, Rutgers University, Robert Wood JohnsonMedical School, Piscataway, NJ 08854, USA4Howard Hughes Medical Institute, Baltimore, MD 21205, USA5Departments of Biophysics and Biophysical Chemistry, Biophysics, and Biomedical Engineering, Johns Hopkins University, MD 21205, USA6Lead Contact*Correspondence: [email protected] (S.S.P.), [email protected] (T.H.)

https://doi.org/10.1016/j.molcel.2018.04.018

SUMMARY

We provide a comprehensive analysis of transcrip-tion in real time by T7 RNA Polymerase (RNAP) us-ing single-molecule fluorescence resonance energytransfer by monitoring the entire life history of tran-scription initiation, including stepwise RNA synthe-sis with near base-pair resolution, abortive cycling,and transition into elongation. Kinetically branchingpathways were observed for abortive initiation withan RNAP either recycling on the same promoter orexchanging with another RNAP from solution. Wedetected fast and slow populations of RNAP in theirtransition into elongation, consistent with the effi-cient and delayed promoter release, respectively,observed in ensemble studies. Real-time moni-toring of abortive cycling using three-probeanalysis showed that the initiation events are sto-chastically branched into productive and failedtranscription. The abortive products are generatedprimarily from initiation events that fail to progressto elongation, and a majority of the productiveevents transit to elongation without making abor-tive products.

INTRODUCTION

The single-subunit T7 RNA polymerase (RNAP) catalyzes tran-

scription fundamentally in the same way as the multi-subunit

RNAPs, in spite of its relatively simple structure unrelated to

multi-subunit RNAPs (McAllister, 1993; Sousa, 1996). It starts

transcription by recognizing a specific DNA sequence termed

T7 promoter, a highly conserved 23-base pair (bp) sequence

from �17 to +6 relative to the initiation site for RNA synthesis

(Rosa, 1979). When the RNAP binds to its promoter DNA, it in-

duces promoter DNA bending and opening to generate a tran-

M

scription bubble from �4 to +2 that is stabilized by the initiating

nucleotide GTP (Figure 1A) (Tang and Patel, 2006a, 2006b; Ujvari

and Martin, 2000). In this initiation complex, where the upstream

promoter region is bound to the N-terminal domain of T7 RNAP

(Figures 1A and 1B), the growing RNA:DNA hybrid from 3 to

7 bps pushes against the N-terminal domain (Figure 1A, pink)

causing it to rotate by 40� (Figure 1B) (Bandwar et al., 2007;

Cheetham and Steitz, 1999; Durniak et al., 2008; Ma et al.,

2005; Mukherjee et al., 2002; Sousa et al., 1993; Tahirov et al.,

2002; Yin and Steitz, 2002). Ensemble fluorescence resonance

energy transfer (FRET) studies have measured stepwise DNA

scrunching and N-terminal domain rotation during initiation

(Tang et al., 2008, 2009). The final IC (initiation complex) to EC

(elongation complex) change involves a dramatic 220� rotation

that abrogates the upstream promoter interactions (Bandwar

et al., 2007; Cheetham and Steitz, 1999; Durniak et al., 2008;

Ma et al., 2005; Mukherjee et al., 2002; Sousa et al., 1993; Ta-

hirov et al., 2002; Yin and Steitz, 2002). In particular, the N-termi-

nal subdomain H (Figure 1C, green subdomain) refolds and

moves �70 A to the opposite end to become part of the RNA

channel in the EC. While the RNAP is in contact with the pro-

moter site, it goes through an abortive synthesis phase that re-

leases short RNAs of 2–13 nt in length (Figure 1D) (Brieba and

Sousa, 2001; Ikeda and Richardson, 1986; Jia and Patel,

1997a; Martin et al., 1988; Tang et al., 2005). Abortive cycling,

the process where RNAP restarts transcription after releasing

short RNA fragments, is a universal feature of transcription

(Carpousis and Gralla, 1980; Martin et al., 1988; Munson and

Reznikoff, 1981). In T7 RNAP, pushback from the rotation of

the N-terminal subdomain accompanying RNA synthesis could

destabilize the transcribing ternary RNAP-DNA complex,

thereby releasing the short RNAs as abortive products. Addition-

ally, persistent interactions between the N-terminal subdomain

(Figure 1A, pink) and the upstream promoter region affect the

abundance of the long abortives (11–13 nt in length) (Figure 1D)

(Tang et al., 2005). Weakening these interactions reduces long

abortives (Bandwar et al., 2006), whereas stable disulfide cross-

link between the two regions increases these products, without

affecting the short abortives (2–8 nt) (Esposito andMartin, 2004).

olecular Cell 70, 695–706, May 17, 2018 ª 2018 Elsevier Inc. 695

Page 3: Correlating Transcription Initiation and Conformational

Figure 1. Structures of T7 RNAP in the Initiation and Elongation Phases and Schematics to Study Single-Molecule Transcription Initiation

(A) T7 RNAP IC with 3-nt RNA (PDB 1QLN). The dotted lines show the approximate location of the downstream promoter DNA.

(B and C) IC (B) with 7-nt RNA (PDB 3E2E) and elongation complex (C) (PDB 1MSW). The DNA template and non-template strands are in green and red,

respectively. The C-terminal domain of T7 RNAP (sky blue) remains mostly unchanged, and the N-terminal subdomains (yellow, pink, and green) undergo major

conformational changes. The donor Cy3 (green ball) was attached to amino acid 174 in subdomain H and acceptor Cy5 (red hexagon) at +17 on the promoter DNA

in configurations 3 and 4. The donor-acceptor distance changes from �80 A in IC to �40 A in EC.

(D) The kinetics of transcription on the �21 to +19 phi10 promoter fragment at 25�C with [g32P]GTP and NTPs (0.5 mM), T7 RNAP (15 mM), and promoter DNA

(10 mM, reproduced from Tang et al., 2005).

(E) Immobilized DNA containing the T7 promoter sequence with the fluorophore locations marked. The biotinylated dsDNA was immobilized on PEG-coated

quartz surface. The yellow box displays the T7 promoter site recognized by the RNAP.

(F) Outline of the single-molecule fluorescence experiments and labeling configurations tomeasure the various steps of the transcription reaction by the T7RNAP.

Different colors in T7 RNAP represent various protein domains. The green domain is subdomain H showing a large movement during IC to EC transition.

Biochemical and structural studies have characterized pro-

moter clearance, initial bubble collapse (Gong et al., 2004; Liu

and Martin, 2002), and transition into elongation in RNAPs

(Bandwar et al., 2007; Cheetham and Steitz, 1999; Durniak

et al., 2008; Ma et al., 2005; Mukherjee et al., 2002; Sousa

et al., 1993; Tahirov et al., 2002; Yin and Steitz, 2002). Because

the crystal structures offer static views, and ensemble methods

provide an average picture of steps during transcription initiation

and transition to elongation, the real-time monitoring of these

processes at the level of a single RNAP is desirable (Herbert

et al., 2008; Kapanidis et al., 2006; Revyakin et al., 2006). For

example, there could be multiple pathways of abortive initiation

and transition to elongation. These pathwaysmay arise from sto-

chastic events during abortive initiation and depend on the

696 Molecular Cell 70, 695–706, May 17, 2018

changing interactions of the RNAP with the promoter DNA, but

ensemble studies cannot sort out kinetic branching between

such pathways. Additionally, we do not know whether every

transcription initiation event has to go through an abortive phase.

Single-molecule techniques (Bustamante, 2008; Tamarat et al.,

2000) can resolve hidden characteristics that are averaged out

in ensemble transcription assays (Friedman and Gelles, 2012;

Herbert et al., 2008; Zhang et al., 2014). Single-molecule FRET

and magnetic tweezers studies have demonstrated DNA open-

ing and scrunching during initiation (Duchi et al., 2018; Kapanidis

et al., 2006; Revyakin et al., 2006; Tang et al., 2008). Similarly,

optical tweezers studies detected RNAP’s elongation in single

base-pair steps (Abbondanzieri et al., 2005), pausing (Herbert

et al., 2006), and termination (Larson et al., 2008). However,

Page 4: Correlating Transcription Initiation and Conformational

the ability to monitor RNA synthesis and the accompanying

conformational changes of both the substrate DNA and RNAP

simultaneously during the various phases of transcription initia-

tion in real time and with a high resolution is lacking.

In this study, we placed fluorescent probes on the promoter

DNA and T7 RNAP and monitored the multistep reactions of

the transcription process in real time and with near single base

resolution using various single-molecule fluorescence methods

including protein-induced fluorescence enhancement, fluores-

cence lifetime and FRET (Ha, 2001; Ha et al., 1996). We moni-

tored the dynamics of each step including promoter binding,

closed to open transition, stepwise elongation of RNA, abortive

cycling, and transition into elongation by single RNAPmolecules.

We could quantify two kinetically distinct pathways of transcrip-

tion initiation and observe in real-time abortive events, transition

to elongation phase, and its intermediates. The unexpected

finding was that a majority of the productive initiation events

occurred with no detectable abortives, which is counterintuitive

but is consistent with probabilistic modeling.

RESULTS AND DISCUSSION

The double-stranded (ds) DNA (47 bp) containing the consensus

T7 promoter sequence was immobilized on a polyethylene glycol

(PEG)-coated quartz surface via neutravidin-biotin interaction,

and reactions were initiated by adding RNAP with or without

ribonucleoside 50-triphosphates (NTPs) (Figure 1E). We used

four different labeling configurations to probe the various steps

of transcription by T7RNAP on single DNAmolecules (Figure 1F).

In configuration 1, Cy3 fluorophore was conjugated to the non-

template strand at position �4, which is at the upstream edge

of the initiation bubble in the promoter site (transcription start

site of the position +1). Measurement of fluorescence intensity

and the lifetime of Cy3 in configuration 1 monitors both promoter

binding and melting by RNAP (Sorokina et al., 2009). In configu-

ration 2, Cy3 and Cy5 were conjugated to position �4 of the

non-template strand and position +17 of the template strand,

respectively, so that FRET between them reports on DNA

bending/scrunching during transcription initiation (Tang et al.,

2008). In configuration 3, RNAP (V174C mutant) was labeled

with Cy3 at its subdomain H (Cy3-RNAP) and DNA was labeled

with Cy5 at position +17 of the template strand (Figures

1A–1C). Thus, FRET reports on the significant relocation of sub-

domain H that occurs during the IC-to-EC transition (Cheetham

and Steitz, 1999; Sousa et al., 1993; Tahirov et al., 2002; Yin and

Steitz, 2002). Finally, configuration 4 combines configurations

2 and 3, where Cy3-RNAP was used with Cy3-Cy5 doubly

labeled DNA, simultaneously reporting on transcription initiation

and IC-to-EC transition.

RNAP Binding and Promoter OpeningFirst, we measured the single-molecule kinetics of T7 RNAP

binding to Cy3-labeled DNA in configuration 1. The fluorescence

intensity of Cy3 increases by its proximity to a protein (Hwang

and Myong, 2014; Luo et al., 2007; Myong et al., 2009). Before

the addition of the RNAP, the intensity of Cy3 at position �4 on

the non-template strand (configuration 1) remained low and

constant over time, but upon adding T7 RNAP, the Cy3 signal

fluctuated among three intensity levels; low, mid, and high (Fig-

ures 2A and 2B). The low-intensity state comes fromCy3-labeled

DNA by itself (U), and the mid/high-intensity states arise from the

RNAP-bound DNA species (B). We measured the single-mole-

cule fluorescence lifetimes using time-correlated single photon

counting to test our states assignment further (Figure 2C). The

fluorescence lifetime of Cy3, which is the average time between

its excitation and fluorescence photon emission, is sensitive to

the local environment and has distinct values for closed and

open complexes (Sorokina et al., 2009). The lifetime of the low-

intensity state is the shortest and matches the lifetime of DNA

alone, �0.8 ns (Figure 2D) (Sorokina et al., 2009). The lifetime

of the high-intensity state,�1.3 ns (Figures 2C and 2D), matches

the lifetimes of both the open complex bound to the initiating

NTP, 30-dGTP (Sorokina et al., 2009; Stano et al., 2002) and

the pre-melted DNA-RNAP (mismatches from �4 to +2) (Soro-

kina et al., 2009). Therefore, we assign the low-intensity state

to the DNA-only state and the high-intensity state to the open

complex with a transcription bubble. We putatively assign the

mid intensity state with an intermediate lifetime to the RNAP-

bound closed complex, where the DNA is not open, but the

DNA is bent (see below).

The closed and open complexes (state C and state O, respec-

tively) in the time traces were grouped as the RNAP-bound state

B (blue box), and the DNA only is the unbound state U

(yellow box) (Figure 2B). Dwell-time analysis of the U and B

states provided the kinetics of RNAP binding and dissociation

from the DNA, respectively (Figures 2E and 2F). The inverse of

the average dwell time of U depended linearly on RNAP concen-

tration with a slope of 5.66 (±0.16) 3 107 M–1s–1 (Figure 2E),

which is the kon rate constant of RNAP-DNA complex formation.

It is only three times slower than the previous estimate in bulk

(Tang and Patel, 2006b), suggesting that Cy3 labeling at

the �4 base or surface immobilization of the RNAP-DNA com-

plex has only moderate impacts on the RNAP-DNA interaction.

The inverse of the average dwell time of B did not depend on

the RNAP concentration and provided an apparent T7 RNAP

dissociation rate of 1.2 s–1 (Figure 2F).

Within the RNAP-bound B state, two different intensity states,

state C (mid-intensity) and state O (high-intensity) exist (Fig-

ure 2B). The dwell time shows that the state C is more stable

than the open state O (Figures 2G and 2H). The average C dwell

time provides the promoter opening rate of 3.9 ± 0.3 s–1 (Fig-

ure 2H). The average O dwell time was shorter than our time

resolution of 16 ms for time traces in Figures 2A and 2B (corre-

sponding to a rate constant of 62.5 s–1) (Figure 2G), and consis-

tent with the fast rate of promoter closing of 125–150 s–1

measured by stopped-flow kinetics (Stano et al., 2002; Tang

and Patel, 2006a). The fluorescence intensity histogram pro-

vided an O to C ratio of 0.28 (Figure S1A), which is also consis-

tent with the 30%–35% of the open complex observed in bulk

experiments without initiating NTP (Bandwar and Patel, 2001;

Villemain et al., 1997), further supporting our states assignment.

Next, we measured the FRET from the donor (Cy3) and

acceptor (Cy5) intensities of the doubly labeled DNA (configura-

tion 2) upon addition of RNAP (Figures 2I–2K). The apparent

FRET efficiency was the same (�0.3) for all of the species

(U, C, and O), because the quantum yield of Cy3 is affected by

Molecular Cell 70, 695–706, May 17, 2018 697

Page 5: Correlating Transcription Initiation and Conformational

Figure 2. Dynamics of RNAP Binding and

DNA Opening

(A) Real-time observation of T7 RNAP binding to

the promoter DNA and DNA opening. Time traces

of Cy3 intensity measured in configuration 1 show

increases in fluorescence intensity correlated with

the RNAP binding events. Dotted lines are guides

for three different intensity levels.

(B) A zoom-in of the time trace is displayed in the

red box of (A). B and U represent RNAP-bound

and RNAP-unbound states, respectively.

(C) Time traces of fluorescence intensity (top) and

lifetime (bottom) of Cy3 (10 nM RNAP).

(D) Fluorescence lifetime distributions of DNA only

(top), DNA with 10 nM RNAP (middle), and DNA

with 10 nM RNAP and 1mM 30-dGTP. The red and

blue lines show the average lifetimes of O and C

states, respectively.

(E) The binding rate (mean ± SEM) of the RNAP to

the DNA increases linearly with the RNAP con-

centration with a slope of 5.66 (±0.16) 3 107

M–1s–1 corresponding to the kon rate constant.

(F) The RNAP dissociation rates (mean ± SEM)

from the DNA (�1.2 s–1) plotted against the RNAP

concentration.

(G) Dwell-time histogram of state (O) fit to a single

exponential fit estimating the promoter closing

rate constant (>62.5 s–1).

(H) Dwell-time histogram of state (C) and a single

exponential fit providing the promoter opening

rate constant (3.9 s–1).

(I) FRET from the donor (Cy3, green) and acceptor

(Cy5, red)-labeled promoter DNA upon RNAP

binding (configuration 2). Diagram showing

sequential DNA bending caused by RNAP binding,

DNA opening, and initial NTP binding steps.

(J–M) Corrected FRET efficiency histograms from

(J) DNA alone, (K) closed complex, (L) open com-

plex, and (M) IC stalled at +1/+2.

See also Figures S1 and S2.

the RNAP binding and promoter opening steps, and this masks

the change in FRET efficiency. We, therefore, calculated the cor-

rected FRET efficiency (Ecorr) using the relative quantum yields of

Cy3 and Cy5 and the total fluorescence intensity, Itotal (Figures

S1B and S1C). The corrected FRET efficiency histograms distin-

guished the three species (U, C, and O) and provided additional

information about DNA bending (Figures 2I–2L). Ecorr is the

lowest for DNA alone (Figure 2I) and increases from 0.15 /

0.22 upon RNAP binding to DNA in the closed complex (Fig-

ure 2J). The higher FRET value of DNA in the C state implies

that the DNA is bent in the closed complex (Figure 2J), which

has not been observed previously because bulk studies could

not resolve DNA bending from DNA opening (Tang and Patel,

2006a). The Ecorr increases further (0.22 / 0.28) when the

RNAP transitions to the open complex (O) (Figure 2K), indicating

698 Molecular Cell 70, 695–706, May 17, 2018

additional DNA bending with transcrip-

tional bubble formation (Tang and Patel,

2006a). Interestingly, when the initiating

NTP (30-dGTP) was added, we observed

an additional increase in Ecorr (0.28 /

0.47) (Figure 2L), which indicates further bending of the DNA in

the IC bound to NTP at position +1/+2 (Tang and Patel, 2006a,

2006b). Figure 2M summarizes the proposed conformational

changes in the promoter DNA caused by RNAP and 30-dGTP

binding based on the sequential increase of FRET efficiency at

each step.

DNA Scrunching/Bending in the ICNext, we measured the FRET in the doubly labeled DNA (config-

uration 2) during initial RNA synthesis (Figure 3A). We stalled

transcription at defined positions using a limiting set of NTPs

and 30dNTP analogs. The quantum yield of Cy3 did not change

between these stalled positions, making the quantum yield

correction of FRET efficiency unnecessary (Figure S2A). The un-

corrected FRET efficiency, E, increased sequentially from the

Page 6: Correlating Transcription Initiation and Conformational

stalled position +1/+2 to position +6: 0.3 for DNA only, 0.43 at

position +1/+2 with 30-dGTP, 0.5 at position +3 with GTP, and

0.65 both at the position +4 and +6, with GTP+30-dATP and

GTP+ATP, respectively (Figure 3A). We previously reported

that these increases in E are caused by a combination of step-

wise scrunching and bending of the DNA, which brings Cy5 at

position +17 closer to Cy3 at position �4 (Durniak et al., 2008;

Tang et al., 2008).

A single high FRET population (E = 0.65) at +6 at saturating

concentrations of GTP (1 mM) and ATP (0.5 mM) indicates that

the RNAP rapidly synthesize the 6-mer RNA and stays at the

position +6 until 6-mer RNA release (Margeat et al., 2006; Tang

et al., 2008). To measure the real-time transcription reaction up

to 6-mer synthesis, we used sub-saturating NTP concentrations

(20 mM GTP and 10 mM ATP). We could discern four distinct

states in single-molecule time traces, transitioning from E = 0.3

through 0.43 and 0.5 to 0.65 (Figure 3B). TheseE valuesmatched

those of the E histograms of DNA only (E = 0.3), RNAP stalled at

position +1/+2 (E = 0.43), position +3 (E = 0.5) and positions +4

to +6 (E = 0.65). Thus, the single-molecule time course repre-

sents the real-time observation of transcription initiation by a sin-

gle T7 RNAP with near base-pair resolution. The steep drop of E

from the highest E state in Figure 3B is due to RNA release as an

abortive product, as we will discuss below.

To determine the average rate of each RNA lengthening

step, we carried out dwell-time analysis of each state

(DNA, +1/+2, +3, +4 to +6). The dwell-time histograms displayed

a peaked distribution instead of a simple exponential, suggest-

ing that there ismore than one rate-limiting stepwithin each state

(Figure 3C). At 20 mM GTP, 10 mM ATP, and 10 nM RNAP, the

average rates of exiting the states with E = 0.3, 0.43, 0.5, and

0.65 were 0.26 (±0.08), 1.53 (±0.34), 0.71 (±0.14), and 0.07

(±0.01) s–1, respectively These exit rates of the E = 0.3 and

0.43 states are very similar to the measured rate constants of

2-mer and 3-mer RNA synthesis by pre-steady-state kinetics

at 20 mM GTP (Jia and Patel, 1997b).

Abortive Cycling PathwayAs described above, during transcription initiation in the pres-

ence of GTP and ATP, where transcription cannot continue

past the +6 position, there was an abrupt drop in E after a delay

(Figure 3B). The sudden drop in E cannot be caused by the tran-

sition to EC because structural and biochemical studies indicate

that transition to EC is not expected at position +6 and occurs

between 8 and 12 nt synthesis (Bandwar et al., 2006). Instead,

our analysis below indicates that the decrease in E is due to

RNA release. Thus, the repetitive cycles of stepwise increase

in E and a sudden decrease represent real-time monitoring of

the abortive cycling of the RNAP during transcription initiation

(Figures 4A and 4B). We observed two types of abortive cycling:

one, where the RNAP stays bound to the DNA even after RNA

release and restarts another round of RNA synthesis (RNAP re-

cycling), and, two, where the RNAP dissociates from the DNA

with or after RNA release, so that the next round begins only after

the binding by another RNAP (RNAP exchange) (Figure 4C).

Figure 4A shows an example of the RNAP exchange pathway

(green circle in Figure 4C). Herein, we observed repetitive transi-

tions of E from 0.3 to 0.65 (DNA scrunching/bending during initial

transcription to position +6), and then to 0.43 (red line) (relaxation

of scrunched/bent DNA upon RNA release), and then to 0.3

(black line) (RNAP dissociation). A large drop in Itotal (blue trace)

that reverses protein-induced fluorescence enhancement of Cy3

accompanies the E = 0.3 state, consistent with RNAP dissocia-

tion. The dwell-time analysis provided additional evidence for the

RNAP exchange pathway. If RNAP is dissociating, we expect the

exit rate of the E = 0.3 state to depend on the concentration of

RNAP (KD �10 nM) and not on the concentration of GTP, which

is what we observed (Figures 4D and 4E). The above shows that

the E = 0.3 state is the DNA-only state resulting from RNAP

dissociation accompanying RNA release.

Figure 4B shows an example of the RNAP recycling pathway

(orange circle in Figure 4C). Herein E increases from 0.43 to

0.65 (initial transcription), drops to 0.43 (RNA release), and in-

creases again, suggesting that transcription reinitiates without

RNAP dissociation (no visits to the E = 0.3 state). As expected

for a non-dissociating RNAP, the average dwell time of the

E = 0.43 state depends on the concentration of GTP (the first

three NTPs to be incorporated in the RNA), but not on the

RNAP concentration (Figures 4F and 4G). These results support

our interpretation that the E = 0.43 state is the RNAP-bound

state. Also, the Itotal does not drop significantly, suggesting

that the transcription bubble is intact in this intermediate. If the

bubble had collapsed, a reduction in Cy3 quantum yield would

have changed the total fluorescence intensity more significantly.

We also observed from the same DNA molecule both the

RNAP exchange and RNAP recycling pathways (Figure S3A),

indicating that one or the other abortive cycling pathway is not

due to a chemical defect in single DNA molecules. The partition-

ing between the RNAP exchange and RNAP recycling de-

pended, as expected, on the concentration of GTP. At higher

concentrations of GTP, the RNAP recycling pathway was

preferred (Figure S3B) likely because of kinetic competition be-

tween RNA synthesis and RNAP dissociation.

RNAP Conformational Changes during Transition to ECT7 RNAP undergoes major conformational changes during IC to

EC transition (Figures 1A–1C) (Durniak et al., 2008; Tang et al.,

2008; Yin and Steitz, 2002), including translation of the subdo-

main H by �70 A toward the active site of T7 RNAP to become

a part of the RNA exit tunnel. We prepared a functionally active

cysteine-light RNAP, reported earlier (Mukherjee et al., 2002),

and introduced a cysteine at V174 within the subdomain H and

labeled it with Cy3 (Figure S4A). The DNA was labeled with

Cy5 at position +17 of the template strand (configuration 3,

Figure 5A).

First, we examined E histograms of RNAP complexes in

configuration 3 stalled at positions +1, +4, +6, +7, +8, +9,

and +12 after a reaction time of 1 min (Figure S4B). We observed

a single population (E�0.1) for positions +1, +4, +6, and +7, two

populations, low and mid E (E �0.1 and 0.4) at positions +8

and +9, and a single high E population (E �0.8) at position +12

(Figures 5B and S4B). The low E values observed at positions +1

through +7 are indicative of a large distance between

position +17 on the DNA and V174C on the RNAP, as shown in

Figures 1A and 1B, consistent with the crystal structures of IC3

and IC7 (Durniak et al., 2008). The high E value of the population

Molecular Cell 70, 695–706, May 17, 2018 699

Page 7: Correlating Transcription Initiation and Conformational

Figure 3. Real-Time Observation of Transcription Initiation with near Base-Pair Resolution

(A and B) E histograms in configuration 2 at +1/+2, +3, +4, and +6 transcriptional stall positions (A) and representative time traces of transcription reaction

(20 mM GTP and 10 mM ATP) stalling at position +6 (B). Colored horizontal bands mark different FRET states.

(C) Dwell-time distributions of four distinct FRET states—0.3, 0.43, 0.5, and 0.65—resolved as shown in (B) and fits a gamma distribution function, tn–1e–kt, from

which we calculated the average rates, k/n, of leaving each FRET state.

(D) Each RNA lengthening rate up to 6-nt RNA synthesis and estimation on the rate of abortive product release at each position.

at position +12 is consistent with the completion of IC to EC tran-

sition at +12 (Bandwar et al., 2006; Tang et al., 2009) that brings

the Cy3-labeled subdomain H toward the active site as shown in

Figure 1C. Therefore, our labeling configuration can reliably

detect changes in FRET that are expectedwith the RNAP confor-

mational changes associated with the IC to EC transition.

At positions +8 and +9, the mid E population appeared over

time at the expense of the low E population. The rate was

0.09 min–1 at position +8 and 1.1 min–1 at position +9 (Figures

S5A–S5D). Experiments performed using configuration 2

showed that RNAP complexes stalled at position +8 undergo un-

scrunching/unbending as observed through E decrease over a

similar timescale (Figure S5E). Therefore, RNAP complexes

stalled at +8 and +9 positions are capable of acquiring a struc-

ture where the H subdomain moves over, and DNA scrunch-

ing/bending partially reverses. However, this intermediate

structure appears to be distinct from EC because the Itotal of

700 Molecular Cell 70, 695–706, May 17, 2018

the two E populations at position +8 observed using configura-

tion 2 are comparable (Figures S5F and S5G), meaning that pro-

tein proximity to Cy3 on �4 position is the same. Therefore, the

mid E state at positions +8 and +9 observed using configuration

3 appears to be a new state, which has undergone a partial

conformational change to bring the subdomain H closer to

the +17 position on the DNA while maintaining interactions be-

tween the RNAP and promoter.

Next, we monitored IC to EC transition in real time in configu-

ration 3 with all four NTPs present at saturating concentrations

(1 mM GTP and 0.5 mM ATP, UTP, and CTP) (Figure 5C). The

time trace was partitioned into three segments – IC (pink), tran-

sition (yellow), and EC (blue) (Figure 5C). The low E state is the

IC segment lasting from +1 to +7, the steep increase of E is the

transition segment when the structural changes in RNAP occur,

and the high E state is the EC segment. In the presence of all

NTPs,�10%of time traces showed themid E state (Figure S5H).

Page 8: Correlating Transcription Initiation and Conformational

Figure 4. Abortive Cycling Pathways: RNAP Exchange versus RNAP Recycling

(A and B) Two representative time traces showing abortive cycling pathways of transcription reaction stalling at +6 (20 mMGTP and 10 mMATP) measured using

configuration 2. Green (Cy3 intensity), red (Cy5 intensity), blue (total intensity), and black (E). In the time trace shown in (A), repetitive drops to both E �0.43

(red line) and E = 0.3 state (black line) are observed. In the time trace shown in (B), only repetitive drops to E�0.43 (red line) are observed. The drop in fluorescence

at the very end of the trace at the top panel is due to photobleaching.

(C) The abortive cycling pathways via RNAP exchange (green circle) and RNAP recycling (orange circle).

(D) The exit rate (mean ± SEM) of E = 0.3 state versus [RNAP].

(E) The exit rate (mean ± SEM) of E = 0.3 versus [GTP].

(F and G) The exit rate (mean ± SEM) of E = 0.43 state versus [RNAP] (F) and [GTP] (G).

See also Figure S3.

Dwell-time analysis showed that the IC segment is about 1.2 s

in duration, and this represents the time interval between RNAP

binding and the transition to EC (Figure 5D), which may also

include abortive cycling without RNAP dissociation. This phase

has a duration similar to the 1.3–4.6 s of lag time between

RNAP binding and the beginning of the elongation phase, deter-

mined both using optical tweezers (Skinner et al., 2004) and the

pre-steady-state kinetics of transcription initiation (Jia and Patel,

1997a; Tang et al., 2009). However, this is slower than �0.2 s of

lag time between RNAP binding and the beginning of the elonga-

tion phase obtained by fast fluorescence in situ hybridization

method at 37�C probably because of the temperature difference

(Zhang et al., 2014).

Dwell-time analysis of the transition segment showed fast and

slow initiation populations transitioning into EC (Figure 5E). The

average rate constants of the two distinct populations during

the transition are 18.4 and 4.8 s–1 (Figure 5E). We hypothesize

that the fast and slow populations of initiating complexes arise

from a kinetic branch after 8 nt synthesis. The slower population

might be related to the mid E state detected at positions +8

and +9, which maintains promoter interactions and therefore

transitions to EC more slowly. In ensemble studies, the popula-

tion that maintains upstream promoter interactions ends up

making 11–13 long abortives (Bandwar et al., 2006; Esposito

and Martin, 2004). The faster population would be the IC com-

plex that clears the promoter and transitions to EC quickly

(Bandwar et al., 2007). The biological significance of the fast

and slow populations escaping to elongation is not apparent,

but this mechanism could aid in the initiation of DNA replication.

T7 DNA polymerase uses a T7 RNAP-made RNA transcript

to initiate DNA replication (Richardson, 1983). This transcript

must be handed over from T7 RNAP to the DNA polymerase,

andwe speculate that the slower rate of transition into elongation

would facilitate this process.

Dwell-time analysis of the EC segment showed a relatively

long duration of 1.2 s, which represents the time it takes for

the RNAP to complete the synthesis of RNA after the IC to EC

transition and dissociate from the DNA. The elongation rate of

T7 RNAP is fast (40–200 nt/s) (Anand and Patel, 2006; Kim and

Larson, 2007; Skinner et al., 2004); hence, short RNAs (14 nt

from +12 to +25) are synthesized quickly. The relatively long

dwell time in the EC segment suggests that the RNAP does

Molecular Cell 70, 695–706, May 17, 2018 701

Page 9: Correlating Transcription Initiation and Conformational

Figure 5. Conformational Change of the RNAP during Transition from IC to EC

(A) In configuration 3, the IC is expected to show a low FRET between Cy3 at aa 174 in T7 RNAP and Cy5 in the promoter at +17, and the elongation complex (EC),

a high FRET based on the crystal structure (Tahirov et al., 2002; Yin and Steitz, 2002).

(B) FRET efficiency histogram of the complex stalled at +1, +9, and +12 transcriptional positions shows an increase in FRET efficiency from initiation to elongation.

(C) Representative time trace of the transcription reaction in the presence of all NTPs showing the intensity and FRET changes accompanying the conformational

changes of RNAP during the transition from IC to EC divided into three sections—IC, transition, and EC.

(D–F) Dwell-time analysis of initiation (D), transition (E), and elongation (F) sections are shown in (C). Fits to gamma distribution functions (two gamma functions for

E) are shown in red.

See also Figures S4 and S5.

not dissociate from the promoter DNA immediately after

completing the RNA synthesis.

Failed versus Productive InitiationWith the tools in place to probe the various steps of transcription,

we set out to investigate whether we could observe a full tran-

scription cycle, in particular, visualize both abortive initiation

and transition into elongation in a single experiment. To take

advantage of both labeling configurations 2 and 3, we designed

a 3-probe experiment with Cy3-labeled RNAP and dual-labeled

DNA (configuration 4, Figure 6A). This 3-probe system can report

on DNA scrunching/bending during abortive initiation and

the conformational change of RNAP during the transition to

elongation.

Figure 6B shows the anticipated time traces from the 3-probe

system, as simulated by averaging the time traces of configura-

tions 2 and 3. The simulated time traces in blue are from config-

uration 2 and in sky blue from configuration 3. The FRET values

at each transcriptional stall position are based on the FRET histo-

grams (Figures 3A, S4B, and S5E). We calculated the simulated

702 Molecular Cell 70, 695–706, May 17, 2018

time traces of the 3-probe experiments (configuration 4) by

averaging the expected FRET efficiencies from experiments in

configurations 2 and 3. We expect, as shown in the black line

in the upper panel of Figure 6B, that the apparent FRET effi-

ciency, E, will decrease when the Cy3-labeled RNAP binds to

the dual-labeled DNA (blue star). This decrease is from the pres-

ence of an additional donor on the RNAP that does not contribute

significantly to FRET in the IC. As initial transcription proceeds,

E will increase progressively due to DNA bending and scrunch-

ing. Finally, a large increase in E will occur when RNAP transi-

tions to EC and undergoes a conformational change that brings

Cy3 on the RNAP close to Cy5 at position +17 of the DNA. When

the RNAP dissociates after producing the full transcript, E will

drop to the initial value of the DNA-only species.

Figure 6B lower panel shows the anticipated time traces from

the 3-probe system, where transcription initiation contains abor-

tive synthesis. We expect that the apparent FRET efficiency,

E, will decrease when the Cy3-labeled RNAP binds to the dual-

labeled DNA (blue star). As initial transcription proceeds, E will

increase progressively due to DNA bending and scrunching

Page 10: Correlating Transcription Initiation and Conformational

Figure 6. Real-Time Observation of the

Entire Transcription Reaction Including the

Direct Observation of Abortive Cycling

(A) The labeling scheme of configuration 2: Cy3

and Cy5 on DNA with wild-type RNAP, configu-

ration 3: Cy5-labeled DNA andCy3-labeled RNAP,

and configuration 4: Cy3 and Cy5 on DNA and

Cy3-labeled RNAP.

(B) Expected time traces of apparent FRET effi-

ciency (black) from the three-probe experiment

were simulated by averaging the time traces of

configuration 2 (blue) and configuration 3 (sky

blue). The top panel shows a simulation for a

productive transcription reaction without abortive

synthesis, and the bottom panel shows a simula-

tion for a productive transcription reaction with

repetitive abortive synthesis. The blue and red

asterisks mark the moments of RNAP binding and

abortion, respectively.

(C–E) Representative time traces of the three-

probe experiments in the presence of all NTPs

showing (C) productive initiation without any

abortion, (D) failed initiation without transition to

EC, and (E) productive initiation with multiple

rounds of abortive initiation. The top panel shows

the total intensity (blue), the middle-panel shows

the intensity of Cy3 (green) and Cy5 (red), and the

bottom-panel shows the FRET efficiency. RNAP

binding is marked with a blue star and abortion

with a red star.

(F) The percentage (mean ± SEM) of failed and

productive initiations for two different promoters,

class III consensus promoter (promoter 1) and

class II T7 promoter ø1.3 (promoter 2). *p < 0.05

(two-tailed unpaired t test).

(G and H) The number of the abortive events per

RNAP (mean ± SEM) in failed (G) and productive

(H) initiation events.

but show repeated drops to lower E (red star) due to abortive

synthesis as observed above in Figure 4A. The above events

could be followed by a large increase in E when the RNAP tran-

sitions to EC as observed in Figure 5C. Finally, the efficiency E

will drop to the initial value of the DNA-only species when the

RNAP dissociates after producing the full transcript.

Representative single-molecule time traces show three typical

examples of transcription events observed with the 3-probe sys-

tem. Some RNAP molecules made the transition to EC without

any abortive event (Figure 6C). Some experienced one or more

rounds of abortive cycling, marked by red stars, without ever

making a transition to EC (Figure 6D), whereas other RNAPs

eventually transited to the elongation phase after one or more

rounds of abortive cycling (Figure 6E). The representative sin-

gle-molecule time shown in Figure 6C is composed of RNAP

binding, initiation, the transition to EC, and RNAP release steps.

Additionally, RNAP binding and dissociation events were clearly

detected via the sudden increase and decrease of the total inten-

sity, respectively (Figures 6C–6E, blue lines).

We classified the transcription events into two categories

based on the outcome: failed initiation if the molecule experi-

ences the transcriptional initiation and fails to transit to the elon-

gation phase (Figure 6D), and productive initiation if the IC to EC

transition occurred (Figures 6C and 6E). The population of failed

initiation described here might be similar to the unproductive

initiation reported from ensemble studies of the bacterial RNAP

(Vo et al., 2003). On the stronger class III consensus promoter

(promoter 1) with which we performed all experiments using

configurations 1, 2, and 3, 72.4% of the RNAPs (of 650 total mol-

ecules) that initiated transcription showed productive initiation,

and 27.6% failed initiation (Figure 6F). Note that we assumed

that all molecules showing the conformational change of the

N-terminal subdomain H would generate run-off products. To

determine whether the branching ratio between the two out-

comes is dependent on the promoter strength, we carried out

experiments with a weaker T7 promoter. The weaker class II

T7 promoter ø1.3 (promoter 2) had the RNAP binding frequency

that is half as high as the binding frequency of promoter 1 (0.08 ±

0.01 versus 0.18 ± 0.02min–1). Promoter 2 also showed less pro-

ductive initiation (52.2% ± 8.1% among �250 molecules in Fig-

ure 6), consistent with ensemble data (Bandwar et al., 2006;

Ikeda, 1992; Lopez et al., 1997).

We counted the number of abortive events in the productive

initiation and failed initiation events in the consensus promoter.

Molecular Cell 70, 695–706, May 17, 2018 703

Page 11: Correlating Transcription Initiation and Conformational

Figure 7. Probabilistic Model Predicts the Outcomes of Transcription Initiation

(A) Probabilistic model is showing that each incidence of open state formation can result in a transition to elongation (E), RNAP dissociation (D), and abortive

product release (A). After an abortive product release (A), the RNAP-DNA complex returns to the open state (O), and the three different paths from (A) start again,

and so on. RNAP that does not experience any initiation is shown as D* to distinguish from the dissociation events of RNAP at the initiation stage.

(B) Abortive to run-off ratio was calculated from the experiment shown in the ensemble gel assay by Tang et al. (2005) on the Phi10 promoter. The error bars

represent SD from 3measurements. The abortive to run-off ratio is 1.33 when all abortive products (2–13-mer) are included is reduced to 0.92 ± 0.06 when 2-mer

products are excluded and is further reduced to 0.75 ± 0.04 when 12–13-mer products are also excluded. The dashed red line denotes the abortive to run-off ratio

predicted by single-molecule experiments.

Interestingly, in the productive initiation, 76.8% ± 1.0% of RNAP

did not undergo any detectable abortion, 17.6% ± 1.3% one

abortive event, and �5% underwent 2 or 3 abortive events. In

failed initiation, 76.5% ± 2.1% of RNAP underwent one abortion,

16.4% ± 1.5% two abortive events, and �7% 3–4 abortive

events (Figures 6G and 6H).

We developed a probabilistic model to explain the various

outcomes of transcription initiation (Figure 7A; see also Supple-

mental Information). Briefly, each incidence of open state

formation can result in the transition to elongation (E), RNAP

dissociation (D), or abortive product release (A). The outcome

A brings about another branching point into E, D, or A and so

on. Here, we assume that there is no memory effect such that

each branch of outcome occurs with history-independent prob-

ability (PE, PD, and PA). Using the experimental data shown in

Figures 6F–6H, for promoter 1, we obtained PA = 0.22, PD =

0.5, and PE as 0.28 (Supplemental Information). We cannot

distinguish between photobleaching of Cy3 and RNAP disso-

ciation; therefore, PD should include a contribution from photo-

bleaching. However, photobleaching does not change the

partitioning ratio between E and A, and the agreement with

gel-based quantification of abortive and run-off products ratio

(see below) suggests that photobleaching has only moderate

effects on the estimated probability values.

We verified these probabilities by comparing the calculated ra-

tio of abortive products to run-off products from ourmodel to the

values obtained from ensemble transcription reactions analyzed

by the gel assay that resolves 2-mer to 19-mer run-off (Supple-

mental Information; Figure 7B). We found a good agreement

between the abortive to run-off ratios from the single-molecule

measurements (0.79) and from the ensemble measurements

(0.75–0.9), after correcting for the fact that single-molecule mea-

surements do not detect 2-mer abortives.

This model and the estimated probability of each branch

enable us to highlight unanticipated and exciting conse-

quences. (1) A significant fraction of the total initiation events

(56% on promoter 1 and 41% on promoter 2) can make it to

704 Molecular Cell 70, 695–706, May 17, 2018

elongation without detectable abortive events. (2) The failed

initiations produce twice the number of abortive events as the

ones from productive initiation, even though the failed popula-

tion is only one-third as large as the productive population. The

counterintuitive finding that the majority of the successful

transcription produce no abortive products, however, can be

explained with probabilistic modeling without invoking RNAP

heterogeneity in branching ratio between productive and failed

initiation.

The methods and findings from this study should be widely

applicable to other single-subunit and multi-subunit RNAPs.

For multi-subunit and single-subunit mitochondrial RNAPs,

the transition to EC can be monitored by the translocation of

its cofactor, instead of the subdomain movement observed

here. Therefore, the approaches we developed here would be

applicable for RNAPs by monitoring the transcription initiation

from DNA conformational change and the transition to EC

from the distance changes between the cofactor and the

DNA promoter.

STAR+METHODS

Detailed methods are provided in the online version of this paper

and include the following:

d KEY RESOURCES TABLE

d CONTACT FOR REAGENT AND RESOURCE SHARING

d METHOD DETAILS

B TIRFM and single-molecule FRET imaging

B Single-molecule fluorescence lifetime measurement

B DNA sequences, labeling, and annealing

B Expression, purificationm and labeling of Cys-light

RNAP with V174C mutation

d QUANTIFICATION AND STATISTICAL ANALYSIS

B Single-molecule data analysis

B Gamma factor correction

B Model of abortive initiation

Page 12: Correlating Transcription Initiation and Conformational

SUPPLEMENTAL INFORMATION

Supplemental Information includes five figures and can be found with this

article online at https://doi.org/10.1016/j.molcel.2018.04.018.

ACKNOWLEDGMENTS

This work was supported by NIH grants (GM118086, GM122569) and by the

Korea Research Foundation (KRF-2006-612-C0020).

AUTHOR CONTRIBUTIONS

Conceptualization, H.R.K., S.S.P., and T.H.; Investigation, H.R.K., R.R., M.S.,

G.-Q.T., and D.N.; Resources, S.S.P. and T.H.; Writing – Original Draft, H.R.K.;

Writing – Review & Editing, H.R.K., S.S.P., and T.H.; Supervision, S.S.P. and

T.H.; Funding Acquisition, H.R.K., S.S.P., and T.H.

DECLARATION OF INTERESTS

The authors declare no competing interests.

Received: December 5, 2017

Revised: February 23, 2018

Accepted: April 19, 2018

Published: May 17, 2018

REFERENCES

Abbondanzieri, E.A., Greenleaf, W.J., Shaevitz, J.W., Landick, R., and Block,

S.M. (2005). Direct observation of base-pair stepping by RNA polymerase.

Nature 438, 460–465.

Anand, V.S., and Patel, S.S. (2006). Transient state kinetics of transcription

elongation by T7 RNA polymerase. J. Biol. Chem. 281, 35677–35685.

Bandwar, R.P., and Patel, S.S. (2001). Peculiar 2-aminopurine fluorescence

monitors the dynamics of open complex formation by bacteriophage T7

RNA polymerase. J. Biol. Chem. 276, 14075–14082.

Bandwar, R.P., Tang, G.Q., and Patel, S.S. (2006). Sequential release of pro-

moter contacts during transcription initiation to elongation transition. J. Mol.

Biol. 360, 466–483.

Bandwar, R.P., Ma, N., Emanuel, S.A., Anikin, M., Vassylyev, D.G., Patel,

S.S., and McAllister, W.T. (2007). The transition to an elongation complex

by T7 RNA polymerase is a multistep process. J. Biol. Chem. 282,

22879–22886.

Brieba, L.G., and Sousa, R. (2001). T7 promoter release mediated by DNA

scrunching. EMBO J. 20, 6826–6835.

Bustamante, C. (2008). In singulo biochemistry: When less is more. Annu. Rev.

Biochem. 77, 45–50.

Carpousis, A.J., andGralla, J.D. (1980). Cycling of ribonucleic acid polymerase

to produce oligonucleotides during initiation in vitro at the lac UV5 promoter.

Biochemistry 19, 3245–3253.

Cheetham, G.M., and Steitz, T.A. (1999). Structure of a transcribing T7 RNA

polymerase initiation complex. Science 286, 2305–2309.

Duchi, D., Gryte, K., Robb, N.C., Morichaud, Z., Sheppard, C., Brodolin, K.,

Wigneshweraraj, S., and Kapanidis, A.N. (2018). Conformational heterogeneity

and bubble dynamics in single bacterial transcription initiation complexes.

Nucleic Acids Res. 46, 677–688.

Durniak, K.J., Bailey, S., and Steitz, T.A. (2008). The structure of a transcribing

T7 RNA polymerase in transition from initiation to elongation. Science 322,

553–557.

Esposito, E.A., and Martin, C.T. (2004). Cross-linking of promoter DNA to T7

RNA polymerase does not prevent formation of a stable elongation complex.

J. Biol. Chem. 279, 44270–44276.

Friedman, L.J., and Gelles, J. (2012). Mechanism of transcription initiation at

an activator-dependent promoter defined by single-molecule observation.

Cell 148, 679–689.

Gong, P., Esposito, E.A., andMartin, C.T. (2004). Initial bubble collapse plays a

key role in the transition to elongation in T7 RNA polymerase. J. Biol. Chem.

279, 44277–44285.

Ha, T. (2001). Single-molecule fluorescence resonance energy transfer.

Methods 25, 78–86.

Ha, T., Enderle, T., Ogletree, D.F., Chemla, D.S., Selvin, P.R., and Weiss, S.

(1996). Probing the interaction between two single molecules: Fluorescence

resonance energy transfer between a single donor and a single acceptor.

Proc. Natl. Acad. Sci. USA 93, 6264–6268.

Herbert, K.M., La Porta, A., Wong, B.J., Mooney, R.A., Neuman, K.C., Landick,

R., and Block, S.M. (2006). Sequence-resolved detection of pausing by single

RNA polymerase molecules. Cell 125, 1083–1094.

Herbert, K.M., Greenleaf, W.J., and Block, S.M. (2008). Single-molecule

studies of RNA polymerase: Motoring along. Annu. Rev. Biochem. 77,

149–176.

Hwang, H., and Myong, S. (2014). Protein induced fluorescence enhancement

(PIFE) for probing protein-nucleic acid interactions. Chem. Soc. Rev. 43,

1221–1229.

Ikeda, R.A. (1992). The efficiency of promoter clearance distinguishes T7 class

II and class III promoters. J. Biol. Chem. 267, 11322–11328.

Ikeda, R.A., and Richardson, C.C. (1986). Interactions of the RNA polymerase

of bacteriophage T7with its promoter during binding and initiation of transcrip-

tion. Proc. Natl. Acad. Sci. USA 83, 3614–3618.

Jia, Y., and Patel, S.S. (1997a). Kinetic mechanism of transcription initiation by

bacteriophage T7 RNA polymerase. Biochemistry 36, 4223–4232.

Jia, Y., and Patel, S.S. (1997b). Kinetic mechanism of GTP binding and RNA

synthesis during transcription initiation by bacteriophage T7 RNA polymerase.

J. Biol. Chem. 272, 30147–30153.

Kapanidis, A.N., Margeat, E., Ho, S.O., Kortkhonjia, E., Weiss, S., and Ebright,

R.H. (2006). Initial transcription by RNA polymerase proceeds through a DNA-

scrunching mechanism. Science 314, 1144–1147.

Kim, J.H., and Larson, R.G. (2007). Single-molecule analysis of 1D diffusion

and transcription elongation of T7 RNA polymerase along individual stretched

DNA molecules. Nucleic Acids Res. 35, 3848–3858.

Larson, M.H., Greenleaf, W.J., Landick, R., and Block, S.M. (2008). Applied

force reveals mechanistic and energetic details of transcription termination.

Cell 132, 971–982.

Liu, C., and Martin, C.T. (2002). Promoter clearance by T7 RNA polymerase.

Initial bubble collapse and transcript dissociation monitored by base analog

fluorescence. J. Biol. Chem. 277, 2725–2731.

Lopez, P.J., Guillerez, J., Sousa, R., and Dreyfus, M. (1997). The low proces-

sivity of T7 RNA polymerase over the initially transcribed sequence can limit

productive initiation in vivo. J. Mol. Biol. 269, 41–51.

Luo, G., Wang, M., Konigsberg, W.H., and Xie, X.S. (2007). Single-molecule

and ensemble fluorescence assays for a functionally important conformational

change in T7 DNA polymerase. Proc. Natl. Acad. Sci. USA 104, 12610–12615.

Ma, K., Temiakov, D., Anikin, M., and McAllister, W.T. (2005). Probing confor-

mational changes in T7 RNA polymerase during initiation and termination by

using engineered disulfide linkages. Proc. Natl. Acad. Sci. USA 102,

17612–17617.

Margeat, E., Kapanidis, A.N., Tinnefeld, P., Wang, Y., Mukhopadhyay, J.,

Ebright, R.H., and Weiss, S. (2006). Direct observation of abortive initiation

and promoter escape within single immobilized transcription complexes.

Biophys. J. 90, 1419–1431.

Martin, C.T., Muller, D.K., and Coleman, J.E. (1988). Processivity in early

stages of transcription by T7 RNA polymerase. Biochemistry 27, 3966–3974.

McAllister, W.T. (1993). Structure and function of the bacteriophage T7 RNA

polymerase (or, the virtues of simplicity). Cell. Mol. Biol. Res. 39, 385–391.

Mukherjee, S., Brieba, L.G., and Sousa, R. (2002). Structural transitions medi-

ating transcription initiation by T7 RNA polymerase. Cell 110, 81–91.

Munson, L.M., and Reznikoff, W.S. (1981). Abortive initiation and long ribonu-

cleic acid synthesis. Biochemistry 20, 2081–2085.

Molecular Cell 70, 695–706, May 17, 2018 705

Page 13: Correlating Transcription Initiation and Conformational

Myong, S., Cui, S., Cornish, P.V., Kirchhofer, A., Gack, M.U., Jung, J.U.,

Hopfner, K.P., and Ha, T. (2009). Cytosolic viral sensor RIG-I is a 50-triphos-phate-dependent translocase on double-stranded RNA. Science 323,

1070–1074.

Revyakin, A., Liu, C., Ebright, R.H., and Strick, T.R. (2006). Abortive initiation

and productive initiation by RNA polymerase involve DNA scrunching.

Science 314, 1139–1143.

Richardson, C.C. (1983). Bacteriophage T7:Minimal requirements for the repli-

cation of a duplex DNA molecule. Cell 33, 315–317.

Rosa, M.D. (1979). Four T7 RNA polymerase promoters contain an identical

23 bp sequence. Cell 16, 815–825.

Roy, R., Hohng, S., and Ha, T. (2008). A practical guide to single-molecule

FRET. Nat. Methods 5, 507–516.

Skinner, G.M., Baumann, C.G., Quinn, D.M., Molloy, J.E., and Hoggett, J.G.

(2004). Promoter binding, initiation, and elongation by bacteriophage T7

RNA polymerase. A single-molecule view of the transcription cycle. J. Biol.

Chem. 279, 3239–3244.

Sorokina, M., Koh, H.R., Patel, S.S., and Ha, T. (2009). Fluorescent lifetime tra-

jectories of a single fluorophore reveal reaction intermediates during transcrip-

tion initiation. J. Am. Chem. Soc. 131, 9630–9631.

Sousa, R. (1996). Structural and mechanistic relationships between nucleic

acid polymerases. Trends Biochem. Sci. 21, 186–190.

Sousa, R., Chung, Y.J., Rose, J.P., andWang, B.C. (1993). Crystal structure of

bacteriophage T7 RNA polymerase at 3.3 A resolution. Nature 364, 593–599.

Stano, N.M., Levin, M.K., and Patel, S.S. (2002). The +2 NTP binding drives

open complex formation in T7 RNA polymerase. J. Biol. Chem. 277,

37292–37300.

Tahirov, T.H., Temiakov, D., Anikin, M., Patlan, V., McAllister, W.T., Vassylyev,

D.G., and Yokoyama, S. (2002). Structure of a T7 RNA polymerase elongation

complex at 2.9 A resolution. Nature 420, 43–50.

706 Molecular Cell 70, 695–706, May 17, 2018

Tamarat, P., Maali, A., Lounis, B., and Orrit, M. (2000). Ten years of single-

molecule spectroscopy. J. Phys. Chem. A 104, 1–16.

Tang, G.Q., and Patel, S.S. (2006a). Rapid binding of T7 RNA polymerase is

followed by simultaneous bending and opening of the promoter DNA.

Biochemistry 45, 4947–4956.

Tang, G.Q., and Patel, S.S. (2006b). T7 RNA polymerase-induced bending of

promoter DNA is coupled to DNA opening. Biochemistry 45, 4936–4946.

Tang, G.Q., Bandwar, R.P., and Patel, S.S. (2005). Extended upstream A-T

sequence increases T7 promoter strength. J. Biol. Chem. 280, 40707–40713.

Tang, G.Q., Roy, R., Ha, T., and Patel, S.S. (2008). Transcription initiation in a

single-subunit RNA polymerase proceeds through DNA scrunching and rota-

tion of the N-terminal subdomains. Mol. Cell 30, 567–577.

Tang, G.Q., Roy, R., Bandwar, R.P., Ha, T., and Patel, S.S. (2009). Real-time

observation of the transition from transcription initiation to elongation of the

RNA polymerase. Proc. Natl. Acad. Sci. USA 106, 22175–22180.

Ujvari, A., and Martin, C.T. (2000). Evidence for DNA bending at the T7 RNA

polymerase promoter. J. Mol. Biol. 295, 1173–1184.

Villemain, J., Guajardo, R., and Sousa, R. (1997). Role of open complex insta-

bility in kinetic promoter selection by bacteriophage T7 RNA polymerase.

J. Mol. Biol. 273, 958–977.

Vo, N.V., Hsu, L.M., Kane, C.M., andChamberlin, M.J. (2003). In vitro studies of

transcript initiation by Escherichia coli RNA polymerase. 2. Formation and

characterization of two distinct classes of initial transcribing complexes.

Biochemistry 42, 3787–3797.

Yin, Y.W., and Steitz, T.A. (2002). Structural basis for the transition from

initiation to elongation transcription in T7 RNA polymerase. Science 298,

1387–1395.

Zhang, Z., Revyakin, A., Grimm, J.B., Lavis, L.D., and Tjian, R. (2014). Single-

molecule tracking of the transcription cycle by sub-second RNA detection.

eLife 3, e01775.

Page 14: Correlating Transcription Initiation and Conformational

STAR+METHODS

KEY RESOURCES TABLE

REAGENT or RESOURCE SOURCE IDENTIFIER

Bacterial and Virus Strains

BL21/pAR1219 Studier FW N/A

BL21/pAR1219 cyslight V174 This paper N/A

Biological Samples

T7 RNA polymerase This paper N/A

V174C Cyslight T7 RNA polymerase This paper N/A

Chemicals, Peptides, and Recombinant Proteins

SP Sephadex Sigma SPC25120

CM-sephadex Sigma SP

DEAE-sephacel GE healthcare GE17-0500-01

g[32P]GTP Perkin Elmer NEG004Z

40% Acrylamide/Bis solution Biorad 1610148EDU

Cy3-NHS/ Cy5-NHS GE Healthcare, Chicago, IL PA13101/ PA15101

mPEG/ biotin PEG Laysan Bio, Arab, AL MPEG-SC-5000/Biotin-PEG-SC-5000

Neutravidin Pierce, Rockford, IL 31000

Trolox Sigma-Aldrich, St. Louis, MO 238813

Glucose oxidase Sigma-Aldrich, St. Louis, MO G2133

Catalase Roche, Indianapolis, IN 10106810001

NTP Roche, Indianapolis, IN 11277057001

dNTP TriLink, San Diego, CA K1007

Oligonucleotides

NT1, NT2, NT5, T1, T2, T5, biotin18, phi10 tempate,

phi10 nontemplate (Refer to the supplementary for

the sequence information)

Integrated DNA Technologies, Coralville, IA Custom order

Software and Algorithms

Data acquisition and analysis software https://cplc.illinois.edu/software/ Home-built

CONTACT FOR REAGENT AND RESOURCE SHARING

Further information and requests for resources and reagents may be directed to and will be fulfilled by the Lead Contact, Smita S.

Patel ([email protected]) and Taekjip Ha ([email protected]).

METHOD DETAILS

All single molecule fluorescence measurements using configurations 2, 3 and 4 were performed with a total internal reflection

fluorescence microscope (TIRFM). Details of the surface passivation, optical configurations, and sample chamber assembly were

previously described (Roy et al., 2008). Imaging was performed upon excitation with 532 nm laser at 22 ± 1�C in 15 mM Tris Acetate

(pH 7.5), 10 mM Magnesium Acetate, 50 mM Sodium Acetate, 5 mM DTT, 0.1 mg/ml BSA and an oxygen scavenging system

(2 mM Trolox (Sigma-Aldrich, St. Louis, MO), 1% (w/v) dextrose, 165 U/ml glucose oxidase (Sigma-Aldrich, St. Louis, MO) and

2170 U/ml catalase (Roche, Indianapolis, IN)). Video recordings were processed to extract single molecule fluorescence intensities

at each frame, and custom written scripts were used to calculate FRET efficiencies. Data acquisition and analysis software can be

downloaded from https://cplc.illinois.edu/software/. Configuration 1 also used TIRFM except for correlative measurements of Cy3

intensity and lifetime, which used a confocal fluorescence scanning microscope equipped with single photon time-correlated single

photon counting as described previously(Sorokina et al., 2009).

Molecular Cell 70, 695–706.e1–e5, May 17, 2018 e1

Page 15: Correlating Transcription Initiation and Conformational

TIRFM and single-molecule FRET imagingWe used a continuous Nd:YAG laser (532 nm, 75mW, CrystaLaser) to excite Cy3 on the DNA or the RNAP by generating evanescent

field via total internal light reflection. Fluorescence from Cy3 and Cy5 are collected by a water-immersion objective lens (60X, N.A. =

1.2, Olympus) after passing through a 550 nm long-pass filter (E550LP, Chroma), which filters out the excitation laser beam. The

filtered fluorescence light from Cy3 and Cy5 is divided by the 630 nm dichroic mirror (645DCXR, Chroma), and the one from Cy3

is reflected, and the one from Cy5 is transmitted. The divided fluorescence is focused into and detected by an EMCCD camera

(DU-589, Andor) with a time resolution of 16-100 ms and amplified before camera readout. Video recordings were processed to

extract single molecule fluorescence intensities at each frame, and custom written scripts were used to calculate FRET efficiencies.

Data acquisition and analysis software can be downloaded from https://cplc.illinois.edu/software/. Any TIRFM instrument including

commercial instruments that can measure single molecule fluorescence in two colors simultaneously would be suitable for FRET ex-

periments we performed.

PEG-coated quartz surface was prepared to block non-specific binding of DNA and T7 RNAP. Neutravidin (Pierce, Rockford, IL)

was injected onto the PEG surface and incubated for 2 min for non-covalent bonding to the biotinylated PEG, which was 2% of the

total PEG. Subsequently, biotinylated DNA was added and incubated for 5 min for surface immobilization. For single-molecule ex-

periments, appropriate concentration of T7 RNAP and NTPs were added to the immobilized DNA in 15 mM Tris Acetate (pH 7.5),

10 mM Magnesium Acetate, 50 mM Sodium Acetate, 5 mM DTT, 0.1 mg/ml BSA and an oxygen scavenging system (2 mM Trolox

(Sigma-Aldrich, St. Louis, MO), 1% (w/v) dextrose, 165 U/ml glucose oxidase (Sigma-Aldrich, St. Louis, MO) and 2170 U/ml catalase

(Roche, Indianapolis, IN)). All the experiments were performed at room temperature (22 ± 1�C).

Single-molecule fluorescence lifetime measurementThe fluorescent lifetime decays were measured for immobilized molecules using a home-built confocal microscope. The excitation

light is divided by a beam splitter, and one is sent to a photodiode for timing detection, and the other is reflected by a dichroic mirror

and focused on the sample by an oil-immersion objective (100X, NA 1.4). The emitted light is collected by the same objective lens,

focused on a pinhole and then imaged on a Silicon avalanche photodiode (APD) (Micro Photon Devices, PD5CTC). The signals from

the APD and the photodiode are correlated in a Time Correlated Single Photon Counter (TCSPC) device (SPC 630, Becker & Hickl

GmbH) for lifetime measurement.

All molecules were excited one-by-one by a pulsed laser (Spectra-Physics, Vanguard 2000-HM, 532 nm, Pulse width 12 ps, repe-

tition rate 76-80MHz) and located using sample-scanning piezo-controlled stage (Nano-Bio2 (2-axis) nanopositioner andNano-Drive

(2-axis) control system, Mad City Labs Inc.). A 20x20 mm2 area of the sample was translated to 256x256-pixel matrix and scanned

pixel-by-pixel to determine the intensity at each pixel site. The resulting intensity map was used to determine the coordinates of

immobilized single molecules by a Gaussian fit. The DNA concentration was adjusted to provide 40-80 single molecules absorbed

to the 20x20 mm2 area.

The signals from the APD and photodiode were collected by the TCSPC device installed inside a personal computer (Windows XP),

which measured the delay time between the fluorescent photon and the laser time signal, and the global time of fluorescence photon

detection. To construct the intensity trace, we counted the number of photons collected by the APD during 10 ms time interval and

plotted them versus the global time. To extract the lifetime, we divided photons into groups of 500 contiguous photons. For each

group, we built a delay time distribution histogram, which was fitted by an exponential decay curve using Maximum Likelihood Esti-

mator algorithm. The resulting lifetime values were plotted versus global time, constituting the molecule’s lifetime trace.

DNA sequences, labeling, and annealingAll the DNAs used in this study were purchased purified from Integrated DNA Technologies (Coralville, IA). The 65-nt non-template

DNA strands (NT1, NT2, and NT5), their complementary 47-nt template DNA (T1, T2 and T5) and biotinylated 18-nt ssDNA (biotin18)

were incubated at 95�C for 5 min in the annealing buffer (10 mM Tris-HCl, pH 8, 50 mMNaCl) and cooled to room temperature slowly

to generate dsDNA.

NT1:50TGGCGACGGCAGCGAGGCTAAATTAATACGACTCACTATAGGGAGACCACAACGTTATCAGCTTC 30

NT2:50TGGCGACGGCAGCGAGGCTAAATTAATACGACTCACTATAGGGAGAGGCCATCGTTATCAGCTTC 30

NT5:50TGGCGACGGCAGCGAGGCGGAAGTAATACGACTCAGTATAGGGACAATCCATCGTTATCAGCTTC 30

T1:30ATTTAATTATGCTGAGTGATATCCCTCTGGTGTTGCAATAGTCGAAG 50

T2:30ATTTAATTATGCTGAGTGATATCCCTCTCCGGTAGCAATAGTCGAAG 50

T5:30CCTTCATTATGCTGAGTCATATCCCTGTTAGGTAGCAATAGTCGAAG 50

Biotin18: 30 biotin-ACCGCTGCCGTCGCTCCG 50

Underlined T in the non-template DNA and template DNA are an internal C6 amino-modified dT attached with Cy3 and Cy5

N-hydroxylsuccinimide (NHS) ester, respectively, and underlined italic G of the non-template DNA and C of template DNA is the tran-

scription start site.

NT1-T1 and NT2-T2 were used to obtain the FRET histograms at various transcription stall positions. With NT1-T1, we could stall

the RNAP complex at +1, +3, +4, +6, +7, +13, +14, and with NT2-T2, at +1, +3, +4, +8, +9, +11, +12. NT5-T5 had a different promoter

e2 Molecular Cell 70, 695–706.e1–e5, May 17, 2018

Page 16: Correlating Transcription Initiation and Conformational

sequence from the NT1-T1 and NT2-T2 and was used to investigate the effect of promoter sequence on abortive cycling. To stall the

RNAP transcription at a predefined position, a limiting set of NTPs (Roche, Indianapolis, IN) and 30-dNTPs (TriLink, San Diego, CA)

were utilized.

Expression, purificationm and labeling of Cys-light RNAP with V174C mutationWild-type T7 RNAP and Cys-light T7 RNAP with the V174C mutation were purified as described (Mukherjee et al., 2002). The V174C

mutant has similar activity as theWT RNAP (Figure S4A). To monitor the conformational change during the transition from initiation to

elongation, V174C labeled with Cy3 was investigated, because the labeled 174 position shows a dramatic conformational change

during the transition.

The only exposed cysteine 174 of the Cys-light T7 RNAP mutant V174C was labeled with the Cy3 fluorophore as follows: The pro-

tein was extensively dialyzed overnight at 4�C against 50 mM phosphate buffer pH 7.5, 200 mM NaCl and 50% Glycerol to remove

DTT. A 1 mM (final) tris(2-carboxyethyl)phosphine (TCEP) solution was added to protect the reduced cysteine from oxidation. The

labeling reaction was performed by adding Cy3 maleimide (GE lifesciences) that was freshly dissolved in DMSO to the protein in

a protein:dye mixing ratio of 1:1.5. The reaction was continued for 1 h at room temperature or �2 h on ice with occasional shaking.

The glycerol concentration was reduced to less than 5% before applying the mixture to a size-exclusion column to separate the free

unbound dye. For a small-scale preparation, we found that repetitive separation (usually three times) through the mini-spin column

filled with 1 mL buffered Biogel P30 (Bio-Rad) provided satisfactory separation and high recovery yield. Measurements of the protein

absorbance at 280 nm and the labeled dye at 550 nm (Cy3) confirmed that the labeling ratio was 1 to 1. Extensive dialysis against

buffer overnight appeared not to improve quality. The concentration of the labeled protein was calculated as [RNAP] (mM) =

(O.D.280CF280 3 O.D.550)/(εRNAP 3 l), where CF280 is the ratio of dye absorbance at 280 nm relative to the peak absorbance at

550 nm for Cy3 ( = 0.08). The label concentrationwas calculated asO.D.550 /εcy33 l), where l is the length of the cuvette (1 cm), εRNAP =

0.14 mM-1cm-1, and εcy3 = 0.15 mM-1cm-1. As a control, the cysteine-light RNAP without the V174C did not show dye labeling

(less than 10%).

QUANTIFICATION AND STATISTICAL ANALYSIS

Single-molecule data analysisFRET efficiency E = IA / (g ID + IA), where ID is the fluorescence from Cy3 and IA from Cy5 after appropriate removal of crosstalk and

background. Gamma factor, g, was assumed to be 1 for most FRET experiments where the quantum yield of Cy3 and Cy5 were

barely changed except the experiment in Figure 2, where the change of quantum yield of Cy3 was large enough to affect the

FRET efficiency. The FRET efficiency of DNA-only, closed, and open complex, displayed in Figure 2, were obtained by correction

with appropriate gamma factors which we calculated from the absolute gamma factor of DNA-only and the relative gamma factors

among DNA-only, closed and open complex (Figure S2A). The Dwell-time analysis was performed on the distinct intensity or FRET

states to get an average kinetic rate constant. The collected dwell-time was fit to the gamma distribution, t n-1exp(-kt), where we could

calculate the average kinetic rate constant as k/n.

Michaelis-Menten equation was used to fit the kinetic rates dependence on the concentration of RNAP and NTP. It is expressed as

v = Vmax [S] / (KM + [S]) + V0 where v is a kinetic rate of making products, Vmax, a maximum rate constant, V0, a residual rate with no

substrate, [S] is the substrate concentration, and KM, Michaelis-Menten constant.

Gamma factor correctionFRET efficiency E was calculated using E = IA / (g ID + IA), where ID is the fluorescence intensity of the donor, Cy3, and IA of the

acceptor, Cy5, after appropriate correction of background and crosstalk between the two detection channels. The gamma factor,

g, is defined as fAhA/fDhD, where fD and fA are the quantum yields of Cy3 and Cy5, respectively, and hD and hA are the detection

efficiencies of donor and acceptor channels, respectively. Detection efficiency is invariant for a given experimental setup, so the

quantum yield is the only variable term that affects the gamma factor

We checked the relative quantum yield of Cy3 and Cy5 at various transcription stall positions for the DNA construct labeled with

Cy3 at the �4 position of the non-template strand and with Cy5 at the +17 position of the template strand, respectively. The Cy3

intensities increased sequentially from its values in DNA-only to the closed complex and the open complex (Figure S1B), and did

not change appreciably from the open complex to various stalled positions until the transition to the elongation complex (Figure S5F).

The fluorescence intensity of Cy5, measured after direct excitation with 632 nm laser, did not change between the DNA-only, the

closed/open complexes, and throughout the tested transcription stalled positions (Figure S1C). Therefore, the FRET efficiency

was corrected for the RNAP binding and promoter opening steps by employing the gamma factor which takes into account the

changes in quantum yields.

Figure S1B shows the relative Cy3 intensities of DNA-only, the closed complex, and the open complex in a ratio of 1: 1.6: 2.3, which

corresponds to the relative Cy3 quantum yields of the three species. The detection efficiencies of Cy3 and Cy5 emission channels

and the quantum yield of Cy5were invariant during RNAP binding and DNA opening process. Hence, the relative gamma factor of the

three species were calculated as the inverse of Cy3 quantum yield, which resulted in a ratio of 1: 0.63: 0.43 for DNA-only, the closed

complex, and the open complex, respectively. The absolute gamma factor of DNA-only was obtained as 2.5 from single molecule

Molecular Cell 70, 695–706.e1–e5, May 17, 2018 e3

Page 17: Correlating Transcription Initiation and Conformational

time traceswhere Cy5was photobleached and usingDIA/DID upon photobleaching as ameasure of the gamma factor. Therefore, the

gamma factor of the closed complex was calculated as 2.5 3 0.63 = 1.56, and that of the open complex as 2.5 3 0.43 = 1.08.

Even though the uncorrected apparent FRET efficiency, Eapp, of DNA-only, closed and open complex appeared the same as 0.3,

the g-corrected FRET efficiency, Ecorr, were 0.15, 0.22 and 0.28, respectively.

Model of abortive initiationWe developed a phenomenological model to describe abortive initiation (Figure 7A).

The open complex O is assumed to have three possible outcomes:

1) RNAP dissociation (D)

2) RNAP recycling after abortive product release (A)

3) Transition to the elongation phase (E)

Only the second outcome A returns the complex to the initial ‘‘open state (O)’’ to continue with another round with three possible

outcomes. We call the probabilities of each outcome PE, PD and PA.

In our experimental detection of failed initiation, dissociation without initiating transcription is not detected. Therefore, the proba-

bility of failed initiation Pfail, determined experimentally to be 0.28 for promoter 1 (Figure 6F), is given by

Pfail = ðPA 3PD +PA 3PA 3PD +PA 3PA 3PA 3PD +.:Þ=ð1--PDÞ=PD 3PA=ð1� PAÞ=ð1� PDÞ (Eq. 1)

where (1-PD) is the correction factor to account for the fact that the experimental determined Pfail does not include the events where

RNAP dissociates without initiating a transcript.

Likewise, the probability of successful initiation Psucc, determined experimentally to be 0.72 for promoter 1 (Figure 6F), is given by

Psucc = ðPE +PA 3PE +PA 3PA 3PE +.Þ=ð1� PDÞ =PE=ð1� PAÞð1� PDÞ (Eq. 2)

In addition,

PD +PA +PE = 1 (Eq. 3)

For molecules that ultimately failed to transit to the elongation phase, the probability Pfail(n) that n abortive products are made before

RNAP dissociation is given by PD3 PAn/(1-PD). Therefore, PA can be approximated by Pfail(n+1)/Pfail(n) for various n values. Likewise,

using Psucc(n) versus n, we can determine PA independently.

PA = Pfail (n+1) / Pfail (n) = 0.22 for both promoters (which is calculated from Figure 6G) (Equation 3)

By putting PA = 0.22, Pfail = 0.28 and Psucc = 0.72 into Equation 1 and 2,

Pfail =PD 3 0:22=ð1� 0:22Þð1� PDÞ= 0:28

Psucc =PE=ð1� 0:22Þð1� PDÞ= 0:72

By solving the two equations above, we get PD = 0.50 and PE = 0.28 for promoter 1. Similarly, we get PD = 0.63 and PE = 0.15 for the

promoter 2.

We can do additional calculations to address some interesting feature of RNAP transcription for the consensus promoter

(promoter 1) as below:

(1) 78% of the total productive initiation events elongate without any abortion:

PE=ðPE +PA 3PE +PA 3PA 3PE +.Þ=PE=ðPE=ð1� PAÞÞ= 1� PA = 0:78

(2) The ratio of total abortive product from failed initiation to total abortive products from productive initiation is 1.8:

ðPA 3PD + 23PA 3PA 3PD + 33PA 3PA 3PA 3PD +.Þ=ð13PA 3PE + 23PA 3PA 3PE + 33PA 3PA 3PA 3PE +.Þ=�PD 3PA

.ð1� PAÞ2

�.�PE 3PA

.ð1� PAÞ2

= 1:8

e4 Molecular Cell 70, 695–706.e1–e5, May 17, 2018

Page 18: Correlating Transcription Initiation and Conformational

(3) 56% of the total initiation events elongate without any abortion:

PE=ð1� PDÞ= 0:56

(4) 0.79 is the ratio of abortive products to run-off products:

fðPA 3PD + 23PA 3PA 3PD + 33PA 3PA 3PA 3PD +.Þ+ ð13PA 3PE + 23PA 3PA 3PE + 33PA 3PA 3PA 3PE +.Þg=�ðPE +PA 3PE +PA 3PA 3PE +.Þ=

n�PD 3PA

.ð1� PAÞ2

�+�PE 3PA

.ð1� PAÞ2

�o.ðPE=ð1� PAÞÞ= 0:79

To validate our model, we calculated the ratio of abortive products to run-off using the data obtained by the ensemble gel assay

and compared it with the value calculated by our model, 0.79. The calculated value from the gel assay is 1.34. The apparent

discrepancy is likely due to the fact that the 2-mer abortive products are not detectable in the single molecule assay and the

12-mer and 13-mer abortive products (produced after the conformational change of RNAP) are regarded as EC in our single-mole-

cule assay (Figure 7B).

Therefore, the apparent ratio of abortive products to run-off products = (NAbortives(3-13-mer)-NAbortives(11-12-mer))/

(Nrun-off +NAbortives(11-12-mer)) = 0.75.

Where various abortive products and run-off products are obtained by quantifying the gel bands. Therefore, after correcting for

misclassified events, we get a good agreement between ratios obtained from single molecule analysis (0.79) and gel assay (0.75).

Molecular Cell 70, 695–706.e1–e5, May 17, 2018 e5

Page 19: Correlating Transcription Initiation and Conformational

Molecular Cell, Volume 70

Supplemental Information

Correlating Transcription Initiation

and Conformational Changes by a Single-Subunit

RNA Polymerase with Near Base-Pair Resolution

Hye Ran Koh, Rahul Roy, Maria Sorokina, Guo-Qing Tang, Divya Nandakumar, Smita S.Patel, and Taekjip Ha

Page 20: Correlating Transcription Initiation and Conformational

Supplemental Information

Correlating transcription initiation and conformational

changes by a single subunit RNA polymerase with near base

pair resolution

Hye Ran Koh1,2, Rahul Roy1, Maria Sorokina1, Guo-Qing Tang3, Divya

Nandakumar3, Smita S. Patel3* and Taekjip Ha1,4,5,6*

1 Department of Physics and Center for the Physics of Living Cells, University of

Illinois at Urbana-Champaign, Urbana, IL 61801, USA

2 Department of Chemistry, Chung-Ang University, Seoul 06974, Korea

3 Department of Biochemistry and Molecular Biology, Rutgers University, Robert

Wood Johnson Medical School, Piscataway, New Jersey 08854, USA

4 Howard Hughes Medical Institute, Baltimore, MD 21205, USA

5 Departments of Biophysics and Biophysical Chemistry, Biophysics and Biomedical

Engineering, Johns Hopkins University, MD 21205, USA

6 Lead Contact

* Correspondence: [email protected] (S.S.P.), [email protected] (T.H.)

Page 21: Correlating Transcription Initiation and Conformational

2

Supplemental Figures

Figure S1. Related to Figure 2. Transition between the DNA-only state, and the two RNAP

bound states, the closed and open complexes, and relative quantum yields of Cy3 and Cy5

during the transition and transcription initiation.

(A) Population of DNA-only, closed complex and open complex were investigated by analyzing

the average intensity histogram from ~780 molecules at 20 nM RNAP. DNA-only species

appeared as a lowest-intensity peak, closed complex as a mid-intensity peak, and open complex as

highest-intensity peak. The equilibrium constant of closed to open complex was calculated as 0.28

Page 22: Correlating Transcription Initiation and Conformational

3

by dividing the peak area of open complex by the one of closed complex, which is consistent with

a previous bulk data (B) The fluorescence intensity histogram of Cy3 conjugated to the -4 position

of non-template DNA from selected states of DNA-only, closed and open complex. The average

fluorescence intensity was calculated from the histogram, giving the intensity ratio among DNA-

only, the closed complex and the open complex as 1: 1.6: 2.3 which represents a relative quantum

yield of Cy3 among them. (C) The fluorescence intensity of Cy5 conjugated to the +17 position of

template DNA without RNAP, with RNAP but not NTPs, and with RNAP stalled at various

positions. We observed no significant change of Cy5 intensity throughout.

Page 23: Correlating Transcription Initiation and Conformational

4

Figure S2. Related to Figure 2. Gamma factor and dwell time analysis of each RNA

lengthening step during transcription initiation

(A) The gamma factor was calculated at various stall position, reporting no significant change of

gamma factor during transcription initiation from +1 to +11. It suggests that the quantum yield of

Cy3 is sustained during transcription initiation. (B) Dwell time distribution of four distinct FRET

states - 0.3, 0.43, 0.5 and 0.65 - resolved in Figure 3B. All dwell time histograms showed peaked

distributions which were fit to a gamma distribution function, tn-1e-kt, from which we calculated

the average rates, k/n, of leaving each FRET state.

Page 24: Correlating Transcription Initiation and Conformational

5

Figure S3. Related to Figure 4. Decision between two pathways of abortive cycling: RNAP

recycling vs. RNAP exchange

(A) Time traces showing two pathways of abortive cycling in the same DNA molecule where the

RNAP recycling pathway is displayed with a blue circle and the RNAP exchange pathway with a

red one. The FRET state when RNA is released without RNAP release is displayed as the pink bar.

A RNAP complex can choose one of two pathways after RNA release. (B) The frequency of RNAP

recycling and RNAP exchange pathway was counted at 10 nM RNAP and 500 μM ATP with

various GTP concentration in the range from 10 μM to 500 μM. 23 % of the RNAP were

undergoing recycling at 10 μM GTP, which increased to 93 % at 500 μM GTP. This suggests that

the RNAP recycling pathway becomes dominant at high GTP concentration. We rationalize that

at high GTP concentration, the GTP binding rate becomes faster than the RNAP release rate; hence

the RNAP recycling pathway dominates over the RNAP exchange pathway. The frequency of

RNAP recycling and RNAP exchange is equal at 26 μM of GTP.

Page 25: Correlating Transcription Initiation and Conformational

6

Figure S4. Related to Figure 5. Gel analysis of the transcription activity of the Cy3-labeled

V174C mutant of Cys-light T7 RNAP and its conformational change during the transition to

EC

(A) To test the transcription competency of the Cy3-labeled V174C mutant, 2 µM T7 RNAP (wild-

type or the labeled V174C) was mixed with 1 µM T7 φ10 promoter (from -22 to +18) following

by the addition of the NTP mixture (1 mM GTP, 100 µM GTP, CTP and UTP each) spiked with

-32P-ATP. The reaction buffer is 50 mM Tris-Cl, pH 7.6, 40 mM NaCl, 10 mM MgCl2, 10%

glycerol, and 1mM DTT. The reaction was quenched after time periods of 0, 10, 30 and 60 s by

adding 200 mM EDTA. RNA products were resolved on a 23% polyacrylamide/7M urea

Page 26: Correlating Transcription Initiation and Conformational

7

sequencing gel, The gel was exposed to a phosphor screen and scanned on Typhoon 9410

instrument (Molecular Dynamics). The Cy3-labeled V174C mutant was competent in transcription

and essentially had the same profile of transcription from initiation to elongation as that of the

unlabeled WT polymerase. (B) FRET efficiency between Cy3 conjugated to the residue 174 in the

subdomain H of T7 RNAP and Cy5 conjugated to +17 position of the template DNA strand

(configuration 3) at various transcriptional stall positions at 1 min after addition of RNAP and

NTPs. Low FRET efficiency was observed at transcriptional stalled positions +1 to +7, and high

FRET efficiency was observed at the stalled position +12. This is consistent with the translational

movement of subdomain H of T7 RNAP bring V174 closer to +17 position on the DNA during

the transition to EC. Interestingly, an intermediate FRET efficiency centered at 0.45 was observed

at the stalled positions +8 and +9, which suggested the existence of an intermediate complex on

the pathway from IC to EC. The broad distribution of FRET efficiency at stalled position +11

might be caused by fast the transitions between states or additional states. A plot of peak FRET

efficiency at various transcriptional stalled positions is displayed at the bottom. For position +11,

FRET histogram is broadly distributed without distinct peaks and therefore we omitted this

position from the plot. The plot shows the sudden FRET increase at stalled position +8 from the

transition of the IC to the intermediate complex, and another sharp FRET increase at position +12

caused by the transition to EC.

Page 27: Correlating Transcription Initiation and Conformational

8

Figure S5. Related to Figure 5. Transition to an intermediate state from the IC at stalled

positions +8 and +9 occurs while the RNAP is still in contact with the promoter

(A,B) FRET histograms of Cy5-DNA and Cy3-RNAP (configuration 3) at the transcriptional

stalled position +8 and +9 over time. Low FRET efficiency (IC) disappeared as the mid FRET

efficiency (intermediate state) centered at 0.45 grew over reaction time. (C,D) The transition rate

constant from IC to the intermediate complex was estimated to be 0.09 and 1.1 m-1 for positions

+8 and +9, respectively, by exponential fitting of the ratio of IC over the total population vs. time.

The transition rate constant at position +9 was about 10 times faster than the one at the position

+8. (E) FRET histograms of dual-labeled DNA and non-labeled RNAP (configuration 2) at the

Page 28: Correlating Transcription Initiation and Conformational

9

transcriptional stalled position +8 over time. (F) The total fluorescence intensities of DNA-only,

+3 and +8 RNAP complexes and the corresponding FRET efficiencies were obtained using

configuration 2. The plots show the total intensity vs. Eapp for each of the ~ 5000 molecules. The

total intensity of RNAP complex stalled at position +3 was double the total intensity of DNA-only.

RNAP binding to the promoter site and subsequent promoter opening step caused the quantum

yield increase of Cy3 conjugated to the promoter site. The total intensity of the newly-emerging

FRET peak in the RNAP complex stalled at position +8 (dotted green circle) matches the

intermediate state in (A) and did not decrease, demonstrating that the RNAP is still bound to the

promoter site. (G) With configuration 1, the Cy3 intensity did not increase after adding 10 nM

RNAP because the RNAP-bound state was short lived without NTPs. Fluorescence intensity

doubled as compared to DNA-only when the RNAP is stably bound to the promoter. At position

+8, the Cy3 intensity did not drop back to DNA only, suggesting no promoter clearance at stalled

position +8 in spite of the structural change observed in the RNAP. At position +12, the Cy3

intensity went back to the initial state where no RNAP is stably bound to the promoter DNA. This

suggests that the RNAP is released from the promoter site at position +12 and the transition to EC

is achieved. (H) The intermediate state during the transition from IC to EC identified in stalled

complexes at positions +8 and +9 was short-lived and not clearly distinguished in most time traces.

However, about 10 % of time traces showed a clear intermediate state as shown here.