Upload
others
View
0
Download
0
Embed Size (px)
Citation preview
RNA Synthesis is Associated with Multiple TBP-Chromatin Binding Events Hussain A. Zaidi1, David T. Auble1, Stefan Bekiranov1
1Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA 22908 Cellular processes including transcription are inherently dynamic. Currently, the dynamics of transcription and other molecular processes in the cell are poorly understood1 because of a lack of methods that measure fundamental kinetic parameters in vivo. Precise estimation of the chromatin-binding on- and off-rates of general transcription factors (TFs) would allow stochastic modeling of pre-initiation complex formation2,3, RNA polymerase recruitment and elongation, and transcription4,5. Live-cell imaging at specific multi-copy genes is capable of yielding the residence time of TF-chromatin interactions at high temporal resolution (i.e., second timescale)6 but does not allow these measurements at single-copy genes. Cross-linking kinetic (CLK) analysis is a high spatial and temporal resolution method that enables estimation of the in-vivo TF-chromatin on- and off-rates at single-copy loci7. However, alternative approaches to estimating these kinetic parameters are needed to independently verify CLK as well as live-cell imaging approaches8. We developed and applied a physical modeling approach using chemical kinetic theory that directly estimates the residence time of TATA-binding protein (TBP) across the yeast genome from TBP competition ChIP data9—another high-spatial resolution method10,11, which was generally believed to be low temporal resolution (20 min or greater). Using this approach, we are capable of estimating TBP-chromatin residence times on the minute time-scale across the yeast genome, demonstrating that competition ChIP is actually a relatively high temporal resolution method. Comparing TBP-chromatin residence times with nascent RNA transcription rates12, we find that ~5 TBP binding events are associated with productive RNA synthesis at the typical gene. Our results paint, for the first time, a highly dynamic, stochastic picture of pre-initiation complex formation with multiple rounds of partial assembly and disassembly before productive RNA polymerase elongation. We use kinetic theory to model the in vivo competitive dynamics of the induced competitor and the endogenous TBP in competition ChIP experiment9 with TF-chromatin binding on-rate (��) and off-rate (��) (Supplemental Text Sec 1). In order to fit TBP competition ChIP two-color Agilent tiling microarray data9 to our kinetic model, we first normalize, background subtract, and scale the data for TBP peaks within gene promoters to theoretically expected values at the start of induction (� � 0) and steady state or equilibrium (� � ∞) (Supplemental Text and Extended Data Fig. 1). The kinetic theory explicitly accounts for induction of the competitor. Consequently, we fit the ratio of the induced (denoted by B) over endogenous (denoted by A) TBP concentration (���/��) determined from Western blots as a function of induction time9 to a Hill-like sigmoid function with Hill coefficient � � 4 (Fig. 1a). The fit yields a characteristic time-scale
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
for TBP induction ����� � 22 min and the steady state ratio of induced over endogenous
TBP concentration (��� � ∞/�� � 2.233). The normalized competition ChIP data is also well approximated by an � � 4 Hill-like equation with a time-scale parameter �� (Supplemental Eqn. 3), which quantifies the overall turnover response including induction and turnover dynamics at every TBP peak. We use this Hill-like equation to background subtract and scale the data to the theoretical in-vivo (denoted by superscript �) ratio of fractional occupancy of the competitor ��
� � to the endogenous ��� � TBP
which must satisfy the boundary conditions at the start of induction (��� 0 ��
� 0⁄ � 0) and steady state ( ��
� � ��� � �⁄ 2.233 as � � ∞ ) (Supplemental Eqn. 24 and
Supplemental Text). We simultaneously numerically solve and fit the in vivo chemical kinetic equations of competitive binding between species A and B (Supplemental Eqns. 1 and 2) to normalized, scaled competition ChIP data (see Extended Data Figs. 2 and Supplemental Text). We (and others9-11) ignore the impact of cross-linking theoretically as competition ChIP data was gathered at one cross-linking time (20 min of formaldehyde cross-linking) and show that the resulting off-rate, ��, could be modestly biased (Extended Data Fig. 3 a-d) using a generalization of the CLK framework to competition ChIP (Supplemental Eqns. 4-8). This framework could be used to correct the bias if data is gathered at various crosslinking times7. As noted by Lickwar et al.11, we also find that the in vivo ratio of induced over endogenous TF as a function of induction time is insensitive to the on-rate, ��, and is very sensitive to the off-rate or residence time, ��/ � ln2 /�� (Extended Data Fig. 3 e-f). Consequently, we only arrive at relatively precise values of the residence time (��/) by fitting the kinetic equations to competition ChIP ratio data. Stratifying TBP-containing promoters in 2-minute bands of ��, we show that the average normalized, scaled ratio of competitor over endogenous signals as a function of induction time progressively shows slower rise as �� increases (i.e., moves to the right) (Fig. 1b) with corresponding residence times increasing from 1.4 to 70 minutes (Fig. 1c), showing that residence times could be estimated from the ratio. Indeed, given that fitting the Hill-like equation and chemical kinetic equations should yield highly correlated results, we find a smooth relationship between ��/ and �� up to a point where numerically fitting the chemical kinetic equations becomes unstable at �� � 24.5 min (Fig. 1d and Extended Data Fig. 4). This numerical instability is due to the fact that for promoters with �� �24.5 min, the separation between the normalized, scaled data and the induction curve are well within the noise of the competition ChIP data. For �� � 24.5 min, the normalized, scaled data yield excellent fits to the chemical kinetic equations, move progressively to the right with increasing residence time and, remarkably, allow residence times as short as 1.3 minutes to be estimated (Figs. 1 e-g, Extended Data Fig. 4 e-h, and Extended Data Table 1). Importantly, the lower bound on the time resolution is determined by the noise in the induction and competition ChIP data and not the induction time of the competitor. Because a physical model of competitive binding rooted in reaction-rate theory naturally follows the profiles of the normalized, scaled data as a function of induction time, we are able to apply stringent noise criteria on the residuals of each fit (Supplemental Text) and reliably estimate TBP residence times for 794 Pol II and 205 Pol III genes (Extended
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
Data Table 1). This is in comparison to earlier estimates of relative turnover, �, for 602 Pol II and 264 Pol III genes that were obtained using linear regression to a subset of the data (i.e., 10, 20, 25 and 30 min time points)9. While � and our estimates of �� are correlated (Extended Data Fig. 5a), � is also strongly correlated with the � � 0 ratio of induced over competitor ChIP signals (Extended Data Fig. 5), which suggests insufficient background subtraction influencing the estimates of �. Nevertheless, in agreement with estimates of � made by van Werven et al.9, we find TBP residence times are notably shorter for Pol II compared to Pol III genes (Fig. 2a) and to a lesser extent for TATA compared to TATA-less genes13 (Fig. 2b). In contrast to van Werven et al.9, we find no significant differences between TBP residence times comparing SAGA containing and SAGA free genes (Extended Data Fig. 6d) or TFIID containing and TFIID free genes (Extended Data Fig. 6g). Given that Pol III genes tend to be higher expressed14 and have longer TBP residence times than Pol II genes, we were surprised to find marginally shorter TBP residence times at highly expressed ribosomal protein (RP) genes (Extended Data Fig. 6j) compared to other genes (Fig. 2c). This finding was consistent with higher nascent RNA transcription rates (TRs)12 for shorter TBP residence times at Pol II genes (Fig. 2d). Shorter residence times were also associated with higher levels of transcriptional extrinsic noise15 (Fig. 2e) consistent with recent findings16. With estimates of TR and TBP ��/, we defined transcriptional efficiency, which is the product of the transcription rate and TBP residence time (����/) and represents the number of TBP residence times or binding events associated with productive elongation of Pol II and transcription. Strikingly, we find low transcriptional efficiencies for Pol II genes (Fig. 2f). The median ����/ across Pol II promoters is ~5 TBP binding events for productive RNA synthesis to proceed (Fig. 2f). These findings are consistent with a stable scaffold model of PIC formation17 and function for Pol III genes, and rapid, highly stochastic TPB/PIC dynamics at higher expressed Pol II genes. To gain further insights into the upstream regulation and/or downstream impact of TPB-chromatin binding dynamics especially on regulation of gene expression, we compared TBP residence times to previously derived Rap111 and nucleosome10 relative turnover rates (�) and their relative residence times (���). Notably, we show that � equals �� plus a time-dependent function (Supplemental Eqn. 28, Extended Data Fig. 7) and can be excessively biased if normalized ChIP ratios are not properly scaled to ratios of fractional occupancies before model fitting11 (Supplemental text and Extended Data Fig. 8). Nevertheless, we find TBP residence time is correlated with Rap1 ��� at non-RP Pol II genes but not at RP Pol II genes (Fig. 3a). TBP ��/ (corr � �0.11; Extended Data Fig. 9a) and Rap1 ��� (Fig. 3b) show weak negative and positive correlation with transcription rate, respectively, while transcriptional efficiency is modestly correlated with Rap1 relative residence time at non-RP Pol II genes (Fig. 3c). Interestingly, the majority of sites, for which Rap1 relative residence times were determined (ranging from 30-150 min), contain highly dynamic TBP (��/ � 1.4 min or �� � 24.5 min; Extended Data Fig. 9b), further supporting our findings that Rap1 relative residence times11 are 20 to 30 fold higher than, but likely correlated with, Rap1 ��/ . While +1 nucleosome dynamics are poorly correlated with TBP residence time (Figs. 3d, Extended Data Figs. 9 c,d), they are positively correlated with transcription rate (Figs. 3e, Extended Data Fig.
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
9e) and efficiency (Figs. 3f, Extended Data Fig. 9f). These results suggest that while the dynamics and not merely the presence (Extended Data Fig. 6m) of transcription factors like Rap1 potentially regulate TBP/PIC dynamics, TBP and Rap1 recruitment and dynamics are not the rate-limiting step in transcription at Pol II genes. Conversely, the dynamics of factor that regulate elongation including +1 nucleosome turnover may play a more critical role in determining transcription rate and efficiency. To further assess the hypothesis that transcription factors associated with elongation as opposed to PIC and Pol II recruitment or initiation are the rate limiting step in transcription, we tested the effect that the presence or absence of 202 transcription factors mapped to the yeast genome18 had on TBP residence time, transcription rate and transcription efficiency. We subdivided loci for which we had estimates of TBP residence time into quartiles of the number transcription, initiation and elongation factors, respectively. As expected, the presence of greater numbers of transcription, initiation and elongation factors at promoters had no significant impact on TBP residence times (Figs. 4a-c) but yielded higher transcription rates (Figs. 4d-f) and efficiencies (Fig. 4g-i). Strikingly, the presence of more elongation factors had a much greater impact on both transcription rate (Fig. 4f) and efficiency (Fig. 4i) compared to that of initiation factors (Figs. 4e,h), consistent with our hypothesis. For each of the 202 factors, we also assessed the significance of differences of TBP residence times, transcription rate and efficiency at sites with the factor present compared to sites with that factor absent. We only found one factor, Tfa2 (a TFIIE subunit), whose presence or absence yielded significantly different residence times (Fig. 5a). However, Tfa2 had a more modest effect on transcription rate (Fig. 5b) and no significant effect on efficiency (Fig. 5c). In contrast, we find that 46% and 50% of all the initiation and elongation factors mapped, respectively, significantly modulate transcription rate and efficiency (Fig. 5d,e). Not surprisingly, many of these factors are members of initiation and elongation complexes whose enrichment at promoters leads to both increased transcription rate and efficiency (Fig. 5f). We find that the dynamics as opposed to the presence of factors like Rap1 affect TBP residence time, suggesting a number of unknown dynamic relationships and regulatory roles for transcription factors that await characterization and testing using methods that measure dynamic parameters including residence time and transcriptional efficiency. These approaches and measurements should ultimately allow the stochastic processes of pre-initiation complex formation, Pol II recruitment and elongation and transcription to be characterized quantitatively. Supplementary Information is linked to the online version of the paper at www.nature.com/nature. Acknowledgements We would like to thank members of the Bekiranov lab, particularly Brian Capaldo, and the Auble lab, especially Elizabeth Hoffman, for helpful suggestions. This research was supported by NIH grant R21 GM110380 (awarded to S.B and D.T.A) and NIH grant R01 GM55763 (awarded to D.T.A).
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
Author Contributions S.B. conceived the approach and guided method development and data analysis. H.A.Z. developed and applied the methods and performed the data analysis. D.T.A. made critical analysis suggestions. H.A.Z., D.T.A. and S.B. interpreted the results and wrote the paper. Author Information The authors declare no competing financial interests. Correspondence and requests for materials, software and information should be addressed to S.B. ([email protected]) or H.A.Z ([email protected]). Methods Background subtraction, normalization and scaling of competition ChIP data. The raw data generated by van Werven et al.9 (ArrayExpress E-M-TAB-58) reported the optical signal intensity for induced (SHA) and endogenous (SAvi) protein concentrations hybridized on an Agilent whole-genome microarray. SHA and SAvi were replicated by swapping Cy3 and Cy5 dyes to take into account dye-specific variations in the intensity of the optical signal. In our analysis, we used the geometric average of the two dye-swapped ratios, �� , as described in Supplemental Text. Non-specific background probes were identified by fitting a normal curve to the right edge of the � � 0 minute log2(SHA/SAvi) data as shown in Extended Data Figure 1. For our analysis, we selected signal probes in the tail of the normal fit to the non-specific background with a false discovery rate (FDR) of 0.05 or less in the � � 0 minute data. �� values were normalized across time points, �, by dividing them by the background mean obtained from the normal fit to the background probes (Extended Data Figure 1). To quantify the induction of HA over time, we fit a Hill-like sigmoid curve with � � 4 to the ratio of the concentration of HA over Avi
� �� ���
��
, where A and B denote Avi and HA, respectively. The fit gives an induction time
(����� ) of 22 minutes and the saturation value of HA/Avi concentration ratio of 2.23
(Supplemental Eqn. 3, and Fig. 1a). We relate the empirical values of normalized �� for the signal probes in our data to the ratio of the in-vivo fractional occupancy of HA (��
� )
and Avi (��� ) as !�� � " � ��
�
��� , where " is the locus-specific differential background
between HA and Avi at � � 0 minutes, and ! denotes a scale factor which effectively quantifies the ratio of the antibody affinities for HA and Avi (Supplemental Text Section 2). To determine ! and " at every TBP peak, a Hill-like sigmoid curve (with � � 4) with the added term " was fit to the normalized �� (Supplemental Eqn. 24). " was subtracted from �� and ! was determined as the ratio of the asymptotic in-vivo concentration ratio of HA/Avi (2.23) over the asymptotic �� value. Hence, after scaling and background
subtraction, !�� � " satisfied the two boundary conditions: ���
���
� 0 for � � 0 , and
���
���
� 2.23 as � � ∞, as required by the kinetic model of in-vivo competitive binding.
Estimation of residence time by fitting chemical kinetic theory based model of competitive binding to normalized, scaled competition ChIP data. The model for in-vivo competitive binding dynamics between endogenous Avi (subscript A) and
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
competitor HA (subscript B) TBP is described by mass-action differential equations linear in the TBP-chromatin association rate (��
� ) and dissociation rate (��� ):
����
��� ���
� ��� # ��
� �� ���
��
$1 � ��� � ��
� % (1)
����
��� ���
� ��� # ��
� $1 � ��� � ��
� % (2)
We have assumed that the association and dissociation rates for endogenous and competitor TBP are the same, superscript i denotes “in-vivo”, and we have absorbed the experimentally undetermined endogenous concentration (��
� into ��� , such that ��
� and ��
� have units of inverse minutes (Supplemental Text Section 2). Equations (1) and (2) cannot be solved analytically due to the time dependence of ��
� , but a solution can be
derived for the case when the induction of HA is instantaneous: �� ���
��
� 0 for � � 0 and
�� ���
��
� &'�(�)�� for � * 0 . Inserting the actual time dependent �� ���
��
in the ideal
induction solution gives an approximate solution to the Equations (1) and (2) (Supplemental Eqns. 19 and 20, Extended Data Fig. 2). We fit the approximate solution of ideal induction to the normalized, scaled ratio data using NonlinearModelFit in Mathematica (Supplemental Text Section 5.1). The ratio data is almost insensitive to ��
� , and hence, we can reliably extract only ��� . The approximate
solution introduces a bias in our estimate of ��� , which we fix using a pre-generated look-
up table (Supplemental Text Section 5.2). Finally, we use our bias-corrected estimates as the starting point for a numerical one-dimensional Newton’s method fit of Equations (1) and (2) to find the minimum of the fit residual and extract ��
� (Supplemental Text Section 5.3). To calculate the derivative of the fit residual required at each iteration of Newton’s method, we numerically solve the in-vivo differential equations using NDSolve in Mathematica. Exceptions to the fitting procedure where we had to change the starting estimate of ��
� or the step size for Newton’s method are noted in Supplemental Text Section 5.
Statistical analyses of residence time, transcription rate and transcription efficiency data. Throughout the main text and the supplement, quoted correlations are Spearman correlation coefficients unless otherwise stated. Kolmogorov-Smirnoff (KS) test was conducted in R using the ks.test function to determine the p-values reported in Figs. 2, 3, and 4 of the main text. For Fig. 5a of the main text, permutation test (which is useful in particular when the test statistic does not follow a normal distribution) was used to calculate the false discovery rate (FDR) for ��/, ��, and ����/ . In other words, loci across the genome were partitioned into two sets for each transcription factor: those that showed a significant enrichment of the transcription factor above the background as determined by Venters et al.18, and those that did not. These two sets were used to conduct permutation test for ��/, ��, and ����/ using permTS in the perm library in R, which gives the mean difference between the two sets along with the p-value for the mean difference. The p-value was adjusted using the Benjamini-Hochberg correction19 using the p.adjust function in R to derive FDR estimates. In Fig. 5d the FDR for ����/
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
was plotted against the FDR for ��, and transcription factors were listed in descending order of ����/ mean differences. The blue dots (representing TFs that affect TR more significantly than ����/) were chosen with a TR FDR < 0.06 and ����/ FDR > 0.1. Red dots (representing TFs that are significant in permutation tests for both TR and ����/) were chosen with TR FDR < 0.1 and ����/ FDR < 0.1. Finally, black dots (representing TFs that potentially affect ����/ more than TR) were chosen with TR FDR > 0.1 and ����/ FDR < 0.1, or TR FDR > 0.45 and ����/ FDR < 0.3. Figure Legends Figure 1 | Estimation of TBP residence time from model fit of normalized, scaled competition ChIP data. a, Ratio of concentration of competitor TBP (denoted by B) to the concentration of endogenous TBP (denoted by A) as a function of induction time measured by Western blotting9 along with a sigmoid fit to the data. The fit gives an induction time ��
��� of 22 minutes and a saturation concentration ratio of 2.233 as the induction time goes to infinity. b, Plot of normalized, scaled competition ChIP ratio data (competitor/endogenous) versus induction time, stratified and averaged in bands of 2 minutes for �� , an estimate of the overall turnover response time, ranging from 24.5 minutes to greater than 40 minutes, along with induction data (dashed). The average data shows a progressively slower rise as �� increases. c, Normalized density of residence times, ��/ , obtained from data within each �� band (same color scheme as panel (b)) showing that larger �� leads to longer residence times. d, log2-log2 plot of ��/ versus �� showing a monotonic relationship between ��/ and �� for �� � 24.5 min. For loci with �� � 24.5 min, the noise in the data and the induction curve makes ��/ estimates imprecise. As a consequence, estimates of residence times shorter than ~ 90 seconds are unreliable. e – g, Representative fits of our chemical kinetic theory based model to the normalized, scaled competition ChIP ratio data and estimates of ��/, along with the Hill-like sigmoid function fit to the induction data (dashed) with ��
��� � 22 min. The colors of the data and the fits correspond to the appropriate �� band shown in (b). Figure 2 | Multiple, minute-scale TBP-chromatin binding events required for transcription at high expressed Pol II genes. a, Normalized density of ��/ for Pol II and Pol II genes showing that median Pol II residence time is 3 minutes, while the median for Pol III genes is 9 minutes. The difference between the two distributions is significant with a Kolmogorov-Smirnoff (KS) p-value = 2.2e-16. b, Normalized ��/ density for TATA-containing versus TATA-less promoters. TATA-containing promoters have over all shorter residence times than TATA-less promoters (KS p-value = 0.0075). c, RP genes have marginally shorter residence times compared to non-RP genes (median RP ��/ � 1.4 min and median non-RP ��/ � 1.6 min; KS p-value=0.25). d, Promoters in the highest quartile of TR tend to have shorter ��/ than promoters in the lowest quartile (KS p-value = 0.005). e, Promoters with higher extrinsic transcriptional noise have lower residence time (KS p-value = 0.048). f, Normalized density of transcription efficiency (defined as the transcription rate multiplied by residence time, ����/) on the
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
log2 scale showing that for a typical Pol II promoter, ~5 TBP turnovers are required before a single molecule of RNA is successfully transcribed. Figure 3 | TBP dynamics are correlated with Rap1 but not +1 nucleosome dynamics. a-c, log2-log2 scatterplot of Rap1 relative residence time (���) versus ��/ (a), �� (b), and ����/ (c) for non-RP genes (red) and RP genes (blue). Rap1 ��� is correlated well with ��/ and ����/ at non-RP genes, but not at RP genes. ��� is mildly correlated with ��, while ��/ is mildly anti-correlated with ��. d-f, Normalized ��/ (d), �� (e) and ����/ (f) densities for genes containing hot and cold +1 nucleosomes. Hot nucleosomes were in the top quartile of nucleosome turnover and cold were in the bottom quartile. There was no difference in the residence times between hot and cold nucleosomes (KS p-value=0.50) (d), but hot nucleosomes tended to have higher �� (KS p-value=0.007) (e) and higher ����/ (KS p-value=1.3e-7) (f). Figure 4 | High numbers of elongation factors at Pol II promoters are associated with higher transcription rates and efficiencies. a-c, Normalized density of ��/ on log2 scale for genes with the upper quartile numbers of bound TFs and genes with the lower quartile numbers of bound TFs (out of 202 mapped TFs18) showing that ��/ is not modulated by the number of overall TFs (a), initiation TFs (b) or elongation TFs (c). d-f, Normalized density of �� on the log2 scale for genes with the upper quartile numbers of bound TFs and genes with the lower quartile numbers of bound TFs showing that �� is modulated by the number of overall TFs (KS p-value=8.6e-5) (d), initiation TFs (KS p-value=9.8e-4) (e) and elongation TFs (KS p-value=3.14e-11) (f). g-i, Normalized density of ����/ on log2 scale for genes with the upper quartile numbers of bound TFs and genes with the lower quartile numbers of bound TFs showing that ����/ is significantly modulated by the number of overall TFs (KS p-value=2.4e-4) (g), initiation TFs (KS p-value=0.05) (h) and elongation TFs (KS p-value=8.5e-8) (i). Figure 5 | Presence of TFIIE results in lower TBP residence times. a-c, The presence of Tfa2 yielded lower ��/ (KS p-value=5.4e-3) (a) and �� (KS p-value=0.028) (b), but ����/ was not affected by Tfa2 (c). d, Significance of the increase or decrease that the presence versus absence of each TF (each dot) of 202 mapped TFs18 had on ����/ (y-axis) and/or �� (x-axis) quantified by performing a permutation t-test and applying a false discovery rate (FDR) correction. Low FDRs were used to identify TFs that are likely to affect the transcription rate only (blue dots), the transcription efficiency only (black dots) or both (red dots). e, Ranked list of TFs (categorized according to access, orchestration, initiation and elongation18) whose presence compared to absence at genes yielded the most significant higher (“Fastest” in table) or lower (“Slowest” in table) �� (blue), ����/ (black), or both �� and ����/ (red). f, Select multi-protein complexes from the list in (e) highlighting the role of each complex in increasing (“Fast Subunits” in table) or decreasing (“Slow Subunits” in table) �� and/or ����/ using the same color scheme as (d) and (e).
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
References 1 Hager, G. L., McNally, J. G. & Mistelli, T. Transcription Dynamics. Mol. Cell 35,
741-753 (2009).
2 Boettiger, A. N., Ralph, P. L. & Evans, S. N. Transcriptional regulation: effects
of promoter proximal pausing on speed, synchrony and reliability. PLoS
Comput Biol 7, e1001136, doi:10.1371/journal.pcbi.1001136 (2011).
3 Boettiger, A. N. Analytic approaches to stochastic gene expression in
multicellular systems. Biophys J 105, 2629-2640,
doi:10.1016/j.bpj.2013.10.033 (2013).
4 Larson, D. R., Zenklusen, D., Wu, B., Chao, J. A. & Singer, R. H. Real-Time
Observation of Transcription Initiation and Elongation on an Endogenous
Yeast Gene. Science 332, 475-478 (2011).
5 Suter, D. M. et al. Mammalian Genes Are Transcribed with Widely Different
Bursting Kinetics. Science 332, 472-474 (2011).
6 Karpova, T. S. et al. Concurrent Fast and Slow Cycling of a Transcriptional
Activator at an Endogenous Promoter. Science 319, 466-469 (2008).
7 Poorey, K. et al. Measuring chromatin interaction dynamics on the second
time scale at single-copy genes. Science 342, 369-372,
doi:10.1126/science.1242369 (2013).
8 Sprouse, R. O. et al. Regulation of TATA binding protein dynamics in living
yeast cells. PNAS 105, 13304-13308 (2008).
9 van Werven, F. J., van Teeffelen, H. A., Holstege, F. C. & Timmers, H. T. Distinct
promoter dynamics of the basal transcription factor TBP across the yeast
genome. Nat Struct Mol Biol 16, 1043-1048, doi:10.1038/nsmb.1674 (2009).
10 Dion, M. F. et al. Dynamics of replication-independent histone turnover in
budding yeast. Science 315, 1405-1408, doi:10.1126/science.1134053
(2007).
11 Lickwar, C. R., Mueller, F., Hanlon, S. E., McNally, J. G. & Lieb, J. D. Genome-
wide protein-DNA binding dynamics suggest a molecular clutch for
transcription factor function. Nature 484, 251-255,
doi:10.1038/nature10985 (2012).
12 Pelechano, V., Chavez, S. & Perez-Ortin, J. E. A complete set of nascent
transcription rates for yeast genes. PLoS One 5, e15442,
doi:10.1371/journal.pone.0015442 (2010).
13 Basehoar, A. D., Zanton, S. J. & Pugh, B. F. Identification and distinct
regulation of yeast TATA box-containing genes. Cell 116, 699-709 (2004).
14 Moqtaderi, Z. & Struhl, K. Genome-wide occupancy profile of the RNA
polymerase III machinery in Saccharomyces cerevisiae reveals loci with
incomplete transcription complexes. Mol Cell Biol 24, 4118-4127 (2004).
15 Stewart-Ornstein, J., Weissman, J. S. & El-Samad, H. Cellular noise regulons
underlie fluctuations in Saccharomyces cerevisiae. Mol Cell 45, 483-493,
doi:10.1016/j.molcel.2011.11.035 (2012).
16 Ravarani, C. N., Chalancon, G., Breker, M., de Groot, N. S. & Babu, M. M. Affinity
and competition for TBP are molecular determinants of gene expression
noise. Nat Commun 7, 10417, doi:10.1038/ncomms10417 (2016).
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
17 Yudkovsky, N., Ranish, J. A. & Hahn, S. A transcription reinitiation
intermediate that is stabilized by activator. Nature 408, 225-229,
doi:10.1038/35041603 (2000).
18 Venters, B. J. et al. A comprehensive genomic binding map of gene and
chromatin regulatory proteins in Saccharomyces. Mol Cell 41, 480-492,
doi:10.1016/j.molcel.2011.01.015 (2011).
19 Reiner, A., Yekutieli, D. & Benjamini, Y. Identifying differentially expressed
genes using false discovery rate controlling procedures. Bioinformatics 19,
368-375 (2003).
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
3.5 4.0 4.5 5.0 5.5 6.0
ï�ï�
04
log2(t0[min])log 2(t 1
2[min])
Ê Ê
ÊÊ
Ê
ÊÊ
Ê
Fit qBiêqAiCBêCA
t1ê2 = 1.4 min
0 20 40 60 80 1000.0
0.5
1.0
1.5
2.0
Time HminutesL
Ratio
Ê ÊÊÊ
Ê
Ê
Ê
Ê
Fit qBiêqAiCBêCA
t1ê2 = 13.86 min
0 20 40 60 80 1000.0
0.5
1.0
1.5
2.0
Time HminutesL
Ratio
Ê
Ê
Ê ÊÊ
Ê
Ê
Ê
Fit qBiêqAiCBêCA
t1ê2 = 69.3 min
0 20 40 60 80 1000.0
0.5
1.0
1.5
2.0
Time HminutesLRatio
Fig.%1%b%a%
e%
d%c%
f%
Ê Ê
Ê
Ê Ê
Ê
Ê Ê
0 20 40 60 800.0
0.5
1.0
1.5
2.0
2.5
Time Hin minutesL
C BêC A
0.0
0.5
1.0
1.5
2.0
2.5
0 25 50 75Time (minutes)
Nor
mal
ized
Sca
led
Sign
al
t0> 40.5 min��������PLQ��������PLQ��������PLQ��������PLQ��������PLQ��������PLQ��������PLQ��������PLQ
Induction curve
0 1 2 3 4 5
01
23
45
6
log2(t1 2 [mins])
Nor
mal
ized
Den
sity
g%
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
0 1 2 3 4 5
0.0
0.4
0.8
log2(t1 2[min])
Nor
mal
ized
dens
ity Cvext>0.123Cvext<0.087
0 1 2 3 4 5
0.0
0.4
0.8
log2(t1 2[mins])
Nor
mal
ized
dens
ity Pol II Pol III
0 1 2 3 4 5
0.0
0.2
0.4
0.6
log2(t1 2[mins])
Nor
mal
ized
dens
ity High TRLow TR
0 1 2 3 4 50.
00.
20.
40.
6log2(t1 2[mins])
Nor
mal
ized
dens
ity TATATATA−less
0 1 2 3 4 5
0.0
0.2
0.4
0.6
log2(t1 2[mins])
Norm
alize
d de
nsity RP genes
Other genes
Fig.%2%
d%
c%b%a%
f%e%
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
0 1 2 3 4 5
0.0
0.2
0.4
0.6
log2(t1 2[mins])
Nor
mal
ized
dens
ity Hot +1 nucCold +1 nuc
−6 −4 −2 0 2 4
0.0
0.2
0.4
log2(TR [molecules/min])
Nor
mal
ized
dens
ity Hot +1 nucCold +1 nuc
−6 −4 −2 0 2 4
0.0
0.2
0.4
log2(TR t1 2[molecules])
Nor
mal
ized
dens
ity Hot +1 nucCold +1 nuc
Fig.%3%
b%a%
e%d%
c%
f%
1 2 3 4
5.0
5.5
6.0
6.5
7.0
log2(t1 2[min])
log 2
(
[min
])
corr = -0.09corr = +0.47
λ-1
ï� ï� ï� 0 �
5.0
5.5
���
���
7.0
log�(TR [molecules/min])
corr = -0.06corr = +0.11
log �
(
[min
])λ-1
ï� ï� ï� 0 �
5.0
5.5
���
���
7.0
log�(TR t1 � ([molecules])
corr = -0.002corr = +0.33
log �
(
[min
])λ-1
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
0 1 2 3 4 5
0.0
0.2
0.4
0.6
0.8
log2(t1 2[mins])
Nor
mal
ized
dens
ity > 12 Elongation TFs< 4 Elongation TFs
0 1 2 3 4 5
0.0
0.2
0.4
0.6
0.8
log2(t1 2[mins])
Norm
alize
d de
nsity > 19 Initiation TFs
< 12 Initiation TFs
−8 −6 −4 −2 0 2 4
0.0
0.2
0.4
log2(TR [molecules/min])
Norm
alize
d de
nsity > 19 Initiation TFs
< 12 Initiation TFs
0 1 2 3 4 5
0.0
0.2
0.4
0.6
0.8
log2(t1 2[mins])
Norm
alize
d de
nsity > 74 TFs
< 40 TFs
−8 −6 −4 −2 0 2 4
0.0
0.2
0.4
log2(TR [molecules/min])
Norm
alize
d de
nsity > 74 TFs
< 40 TFs
−6 −4 −2 0 2 40.00
0.15
0.30
log2(TR t1 2 [molecules])
Nor
mal
ized
den
sity > 74 TFs
< 40 TFs
Fig.%4%
d%
a%
e%
b%
g%
c%
−8 −6 −4 −2 0 2 4
0.0
0.2
0.4
log2(TR [molecules/min])
Nor
mal
ized
dens
ity > 12 Elongation TFs< 4 Elongation TFs
f%
−6 −4 −2 0 2 4
0.0
0.2
0.4
log2(TR t1 2 [molecules])
Nor
mal
ized
den
sity > 12 Elongation TFs
< 4 Elongation TFs
i%
−6 −4 −2 0 2 40.00
0.15
0.30
log2(TR t1 2 [molecules])
Nor
mal
ized
den
sity > 19 Initiation TFs
< 12 Initiation TFs
h%
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint
0 1 2 3 4 5
0.0
0.4
0.8
log2(t1 2[mins])
Nor
mal
ized
dens
ity Tfa2No Tfa2
−8 −6 −4 −2 0 2 4
0.0
0.2
0.4
log2(TR [molecules/min])
Nor
mal
ized
dens
ity Tfa2No Tfa2
−8 −6 −4 −2 0 2 4
0.0
0.2
0.4
log2(TR t1 2 [molecules])
Norm
alize
d de
nsity Tfa2
No Tfa2
Fig.%5%
a%
d%
c%b%
e% f%Fast Subunits
Esa1, Epl1, Eaf3Rlr1, Thp1
Spt7, Chd1, Hfi1, Spt3
Bur6, Ncb2Htb1
Tfa1, Tfa2Ssl2, Kin28, Tfb1,
Tfb3Med4, Srb4Spt16, Pob3
Spt6, Iws1Rpb2, Rpo21, Rpb3Taf14, Taf3, Taf13,
Taf4Set1, Bre2
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
TR FDR
TR t 1
2 FD
R
Slow subunits
Htz1
Rgr1
Swc1, Vps72
Complex
nuA4Tho
SAGA/SLIK
NC2Nucleosome
TFIIETFIIH
MediatorFACTSwrCSpt6PolIITFIID
Compass
Fastest:Spt7, Bur6Tfa1, Ssl2
Kin28, Taf14Med4, Tfa2Tbp, Ncb2Hfi1, Tfb1Taf3, Tfb3
Taf13, Tfg1Spt3, Tfc6
Taf4Srb4
Slowest:Swc1Htz1
Vps72Ioc3Irc20
Slowest:Xpb1Fkh2
Fastest:Chd1Isw1Ycs4 Esa1
Nhp6aEpl1Sir2Spt2Bre1Set1Tup1Ahc1Bre2 Eaf3Htb1
Slowest:Rgr1
Fastest:Cin5Pho2Ume6Sfp1 Ifh1
Rap1 Dig1
Spt23 Msn2
Skn7
InitiationOrchestrationAccess Elongation
Fastest:Dst1Paf1Ctr9
Spt16, Rpo21 Spt6, Rpb3 Rpb2, Iws1
Rpb7Ess1Pob3
Rtt103 Pcf11Rlr1
Rna14Thp1
.CC-BY 4.0 International licensecertified by peer review) is the author/funder. It is made available under aThe copyright holder for this preprint (which was notthis version posted March 22, 2016. . https://doi.org/10.1101/045161doi: bioRxiv preprint