Upload
ivaylo-nankov-naydenov
View
221
Download
0
Embed Size (px)
Citation preview
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 1/12
The regulation of bacterial transcription has been a topicof interest for several decades; the transcriptional regu-latory circuit of the lac operon was described as early as1959 (REF. 1). Bacterial genes can be arranged in operons,which are groups of contiguous genes regulated by a com-mon operator2. Bacterial transcription is carried out by a single RNA polymerase (RNAP) holoenzyme complexthat consists of a core enzymatic machine and a σ-factor.This RNAP holoenzyme binds to different σ-factors thatrecognize different promoters and so control specific setsof genes3. Different bacteria have different numbers of σ-factors, although most have at least a member of thehousekeeping σ-factor family (σ70)4. Transcription fac-tors are additional regulatory factors, although they arenot necessarily part of the holoenzyme. Many transcrip-
tion factors can either promote or repress transcription,depending on the promoter. Escherichia coli encodes 314transcription factors, of which 35% are activators, 43%are repressors and 22% are dual regulators5. Transcriptionis terminated through two different mechanisms: Rho-dependent and Rho-independent termination6. Finally, theinitiation of protein translation requires (in most cases) aShine–Dalgarno motif, which is a short sequence close tothe start codon that recruits the ribosome to the mRNA7.
In addition to this basic regulatory machinery, vari-ous other proteins and regulatory elements increasethe complexity of the events leading from DNA to pro-tein: RNAP-associated proteins affect the processivity
of RNAP8; internal promoters within operons9, smallRNAs (sRNAs)10 and riboswitches (RNAs that regulatetheir own gene activity)11 affect transcription and transla-tion; additional transcription termination regulates thetermination process6; and non-canonical ribosome bind-ing motifs12 and leaderless mRNAs that are translated13 affect protein translation. However, many of the exam-ples of these processes have been considered as ‘oddities’and, until recently, the general view outside the field of microbiology was that bacterial transcription is simpleand well understood.
Over the past 10 years, regulation of gene expres-sion in bacteria has come back into the spotlight, withmany discoveries being made owing to the combinationof classic genetics and biochemical assays with high-
throughput technologies. The large amount of data col-lected has revealed that many of the ‘oddities’ may insteadbe the rule. For instance, sRNAs account in some casesfor 10–20% of the bacterial RNA products and may havean important regulatory role14, and riboswitches affectgene expression upon metabolite binding15. The new evi-dence suggests that the definition of the operon shouldbe redefined, although the new information makes that achallenge. For example, 20% of all Bacillus subtilis genesin polycistronic operons are transcribed from more thanone promoter16. Similarly, almost 6% of the polycistronicoperons contain an internal read-through terminator, atwhich partial continuation of the transcription occurs17.
*Centre for Genomic
Regulation, Universitat
Pompeu Fabra, Av. Dr. Aiguader 88, 08003
Barcelona, Spain.‡ICREA (Institució Catalana de
Recerca i Estudis Avançats),
Passeig Lluís Companys, 23,
08010 Barcelona, Spain.§Present address: Harvard
Medical School, 77 Louis
Pasteur Avenue, Boston,
Massachusetts 02115, USA.||These authors contributed
equally to this work.
Correspondence to L.S.
e-mail: [email protected]
doi:10.1038/nrmicro2620
Bacterial transcriptomics: what isbeyond the RNA horiz-ome?Marc Güell*§ ||, Eva Yus‡ ||, Maria Lluch-Senar* and Luis Serrano*‡
Abstract | Over the past 3 years, bacterial transcriptomics has undergone a massive revolution.
Increased sequencing capacity and novel tools have made it possible to explore the bacterial
transcriptome to an unprecedented depth, which has revealed that the transcriptome is more
complex and dynamic than expected. Alternative transcripts within operons challenge the
classic operon definition, and many small RNAs involved in the regulation of transcription,translation and pathogenesis have been discovered. Furthermore, mRNAs may localize to
specific areas in the cell, and the spatial organization and dynamics of the chromosome have
been shown to be important for transcription. Epigenetic modifications of DNA also affect
transcription, and RNA processing affects translation. Therefore, transcription in bacteria
resembles that in eukaryotes in terms of complexity more closely than was previously thought.
Here we will discuss the contribution of ‘omics’ approaches to these discoveries as well as the
possible impact that they are expected to have in the future.
R E V I E W S
658 | SEPTEMBER 2011 | VOLUME 9 www.nature.com/reviews/micro
© 2011 Macmillan Publishers Limited. All rights reserved
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 2/12
Transcriptome
The complete set of RNA
molecules produced in a cell.
DNA microarrays
Technology used to carry out
measurements of a large
number of transcript levels
simultaneously. Microarrays
consist of a series of
microscopic spots of DNA
oligonucleotides targeting
specific sequences. Theseprobes hybridize to the target
species (usually cDNA).
Probe–target hybridization is
quantified to determine the
abundance of nucleic acid
sequences in the sample.
Tiling arrays
A subtype of DNA microarray
chips. Tiling arrays differ
according to the nature of the
probes. Probe sequences are
tiled and cover the entire
genome. They are used for
whole-transcriptome profiling.
All of these discoveries have benefited from the tech-nical revolution of the so-called ‘omics’ disciplines. Geneexpression studies have clearly been at the forefront of this revolution, and modern techniques have provideda high-resolution view of various aspects of transcriptome organization. What is now on the horizon? How canwe integrate the newly acquired knowledge? In thisReview, we summarize the quantum leap that bacterialtranscriptomics has taken over the past several years andsuggest areas where the new technologies could have amajor impact.
A technical revolution
Recent years have witnessed a revolution in the fieldof bacterial transcriptomics (BOX 1; FIG. 1a). PioneeringRNomics studies used cDNA synthesis from RNA sam-ples, cloning and Sanger sequencing 18, but the transcrip-
tomics revolution started when the development of DNAmicroarrays provided a tool to globally quantify geneexpression19. Several years later, technological break-throughs allowed the design of microarrays containinghigh-density tiled probes (termed tiling arrays) that coverthe entire genome of an organism or contiguous regionsof it, and such tiling arrays provided the first compre-hensive transcriptome map for E. coli20 and several otherbacterial species21–23. However, tiling experiments haveonly recently been able to deliver strict strand-specificity and high-resolution cDNA mapping 21 through the useof actinomycin D in the reverse transcription reactionto inhibit DNA polymerase activity 24 (FIG. 1b). The tiling
arrays also revealed that most transcripts are present inlevels that are just above background noise25. Thus, thissystem has a far from ideal signal-to-noise ratio 26 andcannot offer single-based resolution of transcriptionstart sites (TSSs) (FIG. 1c).
In 2008, RNA-seq was introduced, which involves
deep sequencing of cDNA generated from RNA prepa-rations27,28. This technology has overcome some of thedrawbacks of tiling arrays: it provides single-base reso-lution, a better signal-to-noise ratio owing to a reducedbackground and a higher dynamic range29. RNA-seqallows the detection of various transcriptional features,including the 5′ end of all RNAs30,31, and TSSs canbe detected by selecting for primary RNA transcriptsthat have a 5′-triphosphate (processed RNAs have a5′-monophosphate)32. The current sensitivity of ultra-sequencing makes it possible to study unculturablebacteria or bacteria that cannot be isolated (such as endo-symbionts), and has allowed the transcriptomes of micro-bial communities (metatranscriptomes) to be explored,such as those of communities found in marine samples
(see below)33,34. However, metatranscriptomics technol-ogy is not yet standardized and several issues remain tobe solved, such as the unequal coverage of genes, whichcould result in apparent premature transcriptionaltermination for genes with low expression (FIG. 1c).
The operon, a concept under revision
Genes in bacterial genomes are organized into oper-ons, which are defined as functional genomic units thatcontain multiple genes under the control of a singlepromoter. However, this definition no longer stands in
various aspects (FIG. 2). It was thought that this organiza-tion leads to an equal level of expression for all genes, butuneven gene expression within operons was observedseveral years ago35, and recent genome-wide transcrip-tomics studies have revealed that in many bacterial spe-cies, consecutive genes within operons do not have thesame expression level, leading to operon polarity 21,36 (FIG. 2a). Operon polarity could be explained by the pres-ence of internal transcription terminators, by the activity of sRNAs or by riboswitches within operons that block oractivate transcription in response to metabolite binding(see below)37–39. The combinatorial effect of internal pro-moters and terminators would result in the productionof different transcripts, thereby adding plasticity to theoperon. For instance, mapping transcriptional initiationsites in Helicobacter pylori revealed 337 operons, but the
detection of internal TSSs added 192 additional cistrons40 (FIG. 2b), whereas in Mycoplasma pneumoniae, 341 refer-ence operons could be divided into 447 smaller suboper-ons21 (FIG. 2b). Furthermore, transcription initiation may be coupled to termination in some cases. In the E. coligalactose operon, two promoters are separated by 5 bp.Transcription driven from the first promoter is termi-nated earlier than that from the second, suggesting thattranscription termination could depend on transcrip-tion initiation41, possibly as a result of different assem-blies of the RNA transcriptional machinery as dictatedby the promoter. Transcription elongation factors suchas NusA, NusG or GreA can determine the processivity
Box 1 | Computational resources to process transcriptomics data
Computational approaches have dramatically changed with the evolution of
transcriptomics. In 2004, the first DNA ultrasequencing platforms enabled the
simultaneous reading of several hundred thousand DNA fragments with a read length
greater than 100 bases. Today, they produce over 200 million 75–100‑base reads, and
the large throughput of modern sequencing platforms has resulted in them gradually
replacing the array‑based technologies. RNA‑seq28 is probably one of the most complex
next‑generation applications.The first step after the data have been acquired is to align sequences to a reference
genome, which can be done using a growing number of software packages154. The
choice of alignment algorithm is strongly influenced by both the experiment in
question and the details of the sequencing technology used. These algorithms have to
cope with unprecedented amounts of data and specific errors associated with the
specific platform (for example, a tendency for insertion or deletion errors to occur in
homopolymer reads, or an increase in the likelihood of sequence errors towards the end
of the read). Alignment programs use heuristic techniques to quickly identify a small set
of locations in the reference sequence where the best mapping is most likely to be
found. Within this set of putative mapping locations, slower and more accurate
algorithms are used to map the reads. The first tools used hash‑based alignment
methods: examples include SOAP155, MAQ156 and ELAND (Illumina, unpublished).
A second generation of tools has been developed on the basis of the Burrows–Wheeler
transform (BWT): examples include BOWTIE157, BWA158 and SOAP2 (REF. 159). BWT
implementations are much faster than their hash‑based counterparts at the samesensitivity level.
The second step is to represent and analyse the data. Different tools are available, but
some of the most widely used and powerful are within the Bioconductor project: the
ShortRead package provides functionality for the import, quality assurance, visualization
and basic manipulation (such as pile‑up calculation and read property examination) of
‘short‑read’ DNA sequences. In addition, there are packages that address more specific
aspects, such as identifying differentially expressed genes (DEGseq) or providing
alternative base‑calling algorithms (Rolexa).
R E V I E W S
NATURE REVIEWS | MICROBIOLOGY VOLUME 9 | SEPTEMBER 2011 | 659
© 2011 Macmillan Publishers Limited. All rights reserved
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 3/12
Deep sequencingNew sequencing technologies
that make use of massive
parallelization of the
sequencing process. Data
provided by these new
sequencers consist of a large
number of reads (millions) in
each run but with a short read
length (of a few hundred
bases).
Cistrons
Segments of DNA that have
the information to produce a
polypeptide chain.
of the polymerase42. Moreover, other proteins (such asSpx and Dsk), the 6S sRNA or even small molecules(including (p)ppGpp or NTPs) can bind to the initialRNAP complex independently of DNA and change theproperties of the holoenzyme, although binding dependson environmental conditions42. It is tempting to imag-ine that promoters could engage various transcriptioncomplexes that could produce different RNA outputs.Almost half of the polycistronic operons in M. pneumo-
niae show natural polarity and transcriptional attenu-ation (optional use of terminator signals)21 (FIG. 2a).Transcriptome mapping data for M. pneumoniae haverevealed that, within an operon, transcription attenua-tion generally coincides with the stop codon21. Possibly,as a result of transcriptional–translational coupling, theribosome induces transcriptional termination in stopcodons by an as-yet-unknown mechanism. Indeed, theα-subunit of M. pneumoniae RNAP associates withthe ribosomal protein RpsD, suggesting a crosstalkbetween the transcription and translation machineries43.Furthermore, the first ribosome bound to the mRNAcooperates with the RNAP in transcription elongation44.
These data not only support a coupling between tran-scription and translation but also offer an explanationfor transcriptional attenuation based on an increasedprobability of RNAP release if the ribosome dissociateswhen it encounters a stop codon. The complexity of bacterial operons can thus be compared to that of alter-native splicing in eukaryotes. However, the regulatory mechanisms of this alternative transcription remain tobe unveiled. Deep sequencing technologies now allow
TSSs to be identified to a single-base precision, thereforemaking it possible to map all internal promoters withinan operon32. However, lacking still is a technology thatcould identify the termination site with the same preci-sion, although it is expected that long reads provided by single-molecule DNA sequencing45,46 could cover thatgap. This knowledge is essential to understand the rulesgoverning RNAP release and transcriptional termina-tion at internal operon sites. Combining ultrasequenc-ing and chromatin immunoprecipitation followed by sequencing (ChIP–seq) to target all proteins associatedwith RNAP could also offer insight into which transcrip-tional complexes are assembled at each promoter, as well
Figure 1 | Methodological transcriptomics tool kit. a | Various methods that have been recently developed to measure
different transcriptome features by mapping transcripts, antisense transcripts, transcription start sites (TSSs) and
protein-binding sites. b | Strand specificity in RNA-seq studies. Second-strand cDNA synthesis during reverse transcription
reactions can by prevented with actinomycin D (bottom panel).c | Comparison of the advantages of transcriptome
mapping based on either deep sequencing (RNA-seq) or tiling arrays. These are representative loci of the Mycoplasma
pneumoniae chromosome; the gene name is indicated above the dotted line. ChIP–chip, chromatin immunoprecipitation
followed by microarray; RNAP, RNA polymerase.
R E V I E W S
660 | SEPTEMBER 2011 | VOLUME 9 www.nature.com/reviews/micro
© 2011 Macmillan Publishers Limited. All rights reserved
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 4/12
′ ′
as the relationship between the assembly of these andthe processivity and recognition of transcriptional ter-mination signals. Native elongating transcript sequenc-ing (NET-seq) is a new method that visualizes RNAsin vivo as they are produced47. This technique utilizesthe high stability of the RNAP−DNA−RNA complexthat allows it to be affinity purified without crosslink-ing, so that bound RNAs can be subsequently submit-ted to deep sequencing. This method could be applied
to bacteria, allowing nascent transcripts to be identifiedand both transcription elongation and RNAP pausingto be monitored.
Regulations at the operon level
Like eukaryotic promoters, bacterial promoters can beregulated by more than one signal or transcription fac-tor48. In Caulobacter crescentus, 6% of the genes are tran-scribed from multiple TSSs22, indicating that various
Figure 2 | Recent discoveries in bacterial transcriptomics. a | Intra-operon decaying expression. Almost half of theconsecutive genes within operons in Mycoplasma pneumoniae show ‘staircase’ behaviour21. In general, the steps tend to
occur in the proximity of the stop codons at the end of every gene. b | Alternative transcripts in operons. Three alternative
transcripts are present in the cag25–cag18 primary operon in Helicobacter pylori40. c | Abundance of antisense RNA. The
number of antisense RNAs in various species as a percentage of the number of open reading frames (ORFs). d | Leaderless
mRNAs. Transcripts starting at the translation initiation codon are preferentially bound by the 70S ribosome, whereas
those with a Shine–Dalgarno sequence are preferentially recognized by the 30S ribosomal subunit.e | mRNA intrinsic
regulation. A lysine riboswitch is located upstream of a lysine transporter gene (lmo0795)60. In the presence of lysine
lmo0795, transcription is prevented by a Rho-independent terminator. Functional elements present on mRNA molecules
are depicted. f | Overlapping untranslated regions (UTRs). The lmo0649 mRNA (red) can be transcribed from two
promoters. Transcription from the distal promoter generates a long UTR that overlaps with the coding region of lmo0648
(blue)60. g | Chromosome structure. Bacterial chromatin exhibits different degrees of compaction and supercoiling
depending on the growth phase (it is more compacted in the exponential phase compared to the stationary phase) and
on the transcription status (not shown). h | Epigenetic modifications. Methylation (Me) states of bases (in this case a
guanosine (G)) that affect transcription can be inherited. Binding of a repressor to an unmethylated promoter prevents
transcription of the repressor protein. Perturbations that lead to promoter methylation prevent the repressor from bindingto the promoter; this methylated state is inherited and a new stable state is created. TSS, transcription start site.
R E V I E W S
NATURE REVIEWS | MICROBIOLOGY VOLUME 9 | SEPTEMBER 2011 | 661
© 2011 Macmillan Publishers Limited. All rights reserved
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 5/12
Stationary phase
A stage of bacterial growth in
which the growth rate slows as
a result of nutrient depletion
and accumulation of toxic
products.
activators and/or repressors can regulate transcription.Thus, an operon could respond to two or more differentinputs, enabling higher regulatory plasticity and respon-siveness to stimuli. In some cases, converging signalsregulate the promoter by cooperatively binding to theDNA, thus increasing the robustness of the response49.Cooperative binding can result in either bimodal or all-or-none expression and can turn on gene expression afteran input has crossed the threshold required for activa-tion48; an example of this is the famous phage λ switch inE. coli50. Complex regulatory mechanisms can likewise beobserved when there are bidirectional promoters, whichare also common in eukaryotes; 66% of divergent genepairs in yeast51 and 10% of all promoters in the humangenome show such regulation52. Depending on the overlapbetween the two promoters, they might be co-expressed53,co-expressed with a particular hierarchy 54 or anti-regulatedon each strand. They might also share transcription fac-tor target sites, which would be consistent with the factthat divergent genes tend to have related functions inhigher eukaryotes55. To achieve a global view of such
complex regulatory control in bacteria, all transcriptionfactor binding sites will have to be mapped and all TSSsprecisely determined.
In the case of the gene that encodes Rns, a transcrip-tional regulator from the AraC family, the binding sitefor activators is located downstream of the core pro-moter but, surprisingly, Rns activates its own expres-sion by binding both upstream and downstream of thepromoter56 in a manner that is reminiscent of enhancers in downstream regions in eukaryotes. A recent study onE. coli that examined 600 combinations of promoter andcoding regions from various genes found evidence thatthe regulatory information stored in the coding regionsof genes strongly affects gene expression levels, indicatingthat this could be a more general phenomenon57.
DNA looping, which was discovered in the ara and gal operons and subsequently found in other bacterialregulatory regions58, is used in some bacteria to regulatetranscription. In several E. coli promoters, including thehdeABp promoter, H-NS represses RNAP complexescarrying the σ70 subunit. However, σ38 RNAP com-plexes are not repressed. Atomic force microscopy andother techniques revealed that DNA traps σ70, but not σ38,by completely wrapping it in cooperation with H-NS59.The increase in resolution of experiments that addressthe three-dimensional (3D) structure of chromatin (thatis, chromosome conformation capture (3C) experiments),
in combination with ChIP–seq experiments, will allowa complete analysis of all DNA looping regions andopen the way to a comprehensive study of their role intranscriptional regulation.
sRNAs galore
Although it was known that bacteria express sRNAs of 50–500 bases, the sheer abundance of such sRNAs, andespecially antisense RNAs (some of which are thoughtto have regulatory roles), was unexpected21,40,60,61(FIG. 2c).These RNAs can work in cis by targeting a gene withinthe same locus62 or in trans63,64 by targeting loci elsewherein the genome. sRNAs are involved in regulating various
processes, such as transcriptional interference, transcrip-tional activation, translational control and regulation of mRNA half-life65. sRNAs may be a part of ribonucleo-protein complexes (and thus modulate protein activity,as is the case for 4.5S RNA and transfer mRNA (tmRNA;also known as SsrA)66), sequester proteins (as exempli-fied by 6S RNA, which binds RNAP and induces changesin gene expression at the transition to the stationary
phase67) or directly regulate mRNA translation and/orstability 68. sRNAs could also indirectly regulate tran-scription of neighbouring genes by promoting changesin DNA supercoiling69,70 as a result of transcription of thesRNA gene. For example, in M. pneumoniae, genes withoverlapping antisense transcripts have lower expressionlevels21. Trans-encoded sRNAs typically interact withmultiple mRNAs71, as these sRNAs contact their targetmRNAs in discontinuous patches. Thus, a single RNAcan globally modulate particular physiological responsesand networks in a manner similar to a transcriptionfactor but at the post-transcriptional level72–75 and with
varying degrees of stringency and outcomes76.
The high number of sRNAs indicates that they musthave an important role in bacterial physiology. Unveilingthe functions of these sRNAs in gene regulation shouldbe addressed by high-throughput experiments afteroverexpression or knockout of trans sRNAs or after genesilencing of cis RNAs by complementary antisense RNA.
It has been suggested that the number of antisensetranscripts roughly anticorrelates with the genome size77 based on the finding that in the E. coli genome, whichcontains ~4,000 genes, antisense RNAs represent 2.4%of the total number of genes, whereas in the M. pneumo-niae genome, which is much smaller than the genomesof most other bacteria and contains only 689 protein-coding genes, including only a handful of transcriptionfactors21, ~12% of the genes encode antisense RNAs.Such a disproportionately large antisense RNA popu-lation could be the biological consequence of genomereduction, with transcriptional regulation taken overby RNA-mediated control. However, the use of ultrase-quencing technology in E. coli has led to the percentageof antisense RNAs being revised upward from 2.4% to20% of the genes78. Similarly, in H. pylori, 27% of genesencode for antisense RNAs32, and manual examinationof M. pneumoniae transcripts raised the total to closeto 20%. These data suggest that the proportion of anti-sense RNA could be ~10–20% of the genes in bacteria,and that the small numbers previously observed were
a result of low-sensitivity detection technologies. Thesenumbers are similar to those found in certain eukaryotes(with ~20% in humans79 and plants80) and slightly higherthan in yeast (7%)51. Recent metatranscriptomic stud-ies of complex samples containing thousands of bacte-rial sequences have also corroborated the abundance of sRNAs within bacterial communities; for example, over40,000 new sRNAs were identified in a single study 33.
Aside from the challenge of determining the functionof most of these sRNAs, there are also technical issuesthat remain to be solved. For example, the heterogene-ity that arises from the diverse protocols and samplingmethods used in the different laboratories leads to an
R E V I E W S
662 | SEPTEMBER 2011 | VOLUME 9 www.nature.com/reviews/micro
© 2011 Macmillan Publishers Limited. All rights reserved
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 6/12
overall poor reproducibility in sRNA determination;the results of independent transcriptomics studies inBurkholderia cenocepacia did not overlap81, and stud-ies in Salmonella enterica subsp. enterica serovar Typhiand S. enterica subsp. enterica serovar Typhimuriumdetected only partial overlap, with an overlap of 42 outof 82 annotated sRNAs in one data set 82 and of 5 out of 52 in another data set83. Strand specificity needs to beensured when sequencing RNA transcripts, as artefac-tual sRNAs could be found opposite to coding genes24.Thus, complementary high-throughput approaches21,84 or validation by either quantitative PCR or RACE85 arerequired to confirm all newly discovered sRNAs beforestarting functional studies. These technical issues not-withstanding, the use of omics technologies has allowedthe discovery of conserved sRNAs that could not beidentified by genome sequence analyses86, as well asnew mechanisms, such as the binding of the 6S RNA toRNAP to produce short RNA products87.
Deep sequencing in combination with pulldowns of Hfq proteins in bacteria have identified sRNAs as the
major target of these proteins83. Complete catalogues of sRNAs from several species will allow the identificationof conserved sRNAs, which could point to functionality,whereas anticorrelated expression patterns could suggesta mechanism of regulation of mRNA transcription by sRNAs and identify new targets for functional studies.
sRNAs and small peptides
Some sRNAs are multifunctional as they encode func-tional peptides. Probably the most paradigmatic exam-ple of this is RNAIII in Staphylococcus aureus, which wasone of the first cis-acting RNAs to be described and isinvolved in quorum sensing. RNAIII encodes a 26-amino-acid peptide that may be involved in biofilm integrity 88.Other cases have been documented in E. coli 89,90 andin Listeria monocytogenes60. For example, SgrS causestranslational repression when pre-annealed with ptsGin E. coli91. Interestingly, the 5′ region of SgrS contains a43-amino-acid open reading frame (ORF) termed sgrT ,which is translated during glucose-phosphate stress.Downregulation of ptsG mRNA does not require SgrT,and SgrT by itself has no effect on ptsG mRNA stability.However, cells expressing SgrT alone have a defectiveglucose uptake, even when they exhibit nearly wild-typelevels of PtsG. Together, these data suggest that SgrS isa bifunctional sRNA that encodes physiologically redun-dant but mechanistically distinct functions that contrib-
ute to the same stress response90. This list is expected toexpand substantially as small peptides are difficultto detect using standard proteomics approaches andrequire appropriate isolation and fractionation techniquesthat specifically enrich small proteins92. The combinationof deep sequencing, translation in the three phases of theidentified sRNAs and the guided search for all ORFs witha translation initiation codon using mass spectrometry could reveal a new layer of regulation in bacteria.
RNA processing
It is generally assumed that post-transcriptional regula-tion of RNA in eukaryotes is extremely important and
complex, whereas it is either minimal or non-existent inbacteria. Thus, nascent mRNA in bacteria is thought tobe bound to ribosomes and degraded after some min-utes. However, recent studies have shown the existenceof mechanisms that affect mRNA stability in bacteria(FIG. 3a), as well as different types of RNA processing.RNA degradation is usually triggered by an endonucleo-lytic cleavage, followed by a rapid exonucleolytic activity.Different ribonucleases (RNases) are responsible for thefirst step, depending on the species, and some proteinscan even cleave their own mRNA and thereby autoregu-late their expression93. Several factors affect the stability of a given mRNA, including RNA-binding proteins andsRNA94. For example, removal of the 5′-triphosphate by the enzyme RppH triggers mRNA degradation in E. coli95.Furthermore, RNA stability is species dependent and can
vary depending on the environmental conditions96.There are several examples of RNA processing in
which internal cleavage occurs in a regulated manner 97.It is expected that differential identification of primary RNAs with a 5′-triphosphate and processed RNAs with
a 5′-monophosphate32 will help to identify in vivo mRNAprocessing sites at the genome level and under differentconditions. Contrary to the common view, polyadenyla-tion (addition of untemplated adenosine residues tothe 3′ end of transcripts) is also present in bacteria andcellular organelles of bacterial origin (TABLE 1). In factbacteria perform two types of polyadenylation: classicpoly adenylation catalysed by poly(A) polymerase I, whichadds exclusively adenosines, and addition of a mixture of different bases with about 50% adenosines98 catalysed by polynucleotide phosphorylase (PNPase). Genome-widestudies using poly(T) oligonucleotides to enrich thesetranscripts indicated that around 0.01–0.2% of the tran-scripts have poly(A) tails, but this low number could berelated to the fact that they are degraded very rapidly 99;in B. subtilis, around 20% of the total RNA was found tobe polyadenylated100 and a recent genome-wide study of polyadenylated RNA in E. coli revealed that around 72%of all RNAs that contain a Rho-independent terminationsignal are polyadenylated101. Thus, polyadenylation is acommon modification and, moreover, can vary with thegrowth phases102. Several possible roles for polyadenyla-tion in bacteria have been suggested; it was reported thatpolyadenylation targets transcripts for degradation by thedegradosome103 and that polyadenylation is involved inquality control for transcriptional and processing errors104.
Furthermore, RNA editing can alter the genetic
information by mutation, deletion or insertion of a base.This process has been well documented for eukaryoticendosymbionts, in which RNA editing is promotedby guide RNA, which is another type of sRNA. In themitochondrion of Trypanosoma brucei, for instance,many mRNAs cannot be translated owing to multipleframeshifts; in this case, guide RNAs serve as templatesfor correcting such mistakes at the mRNA level with theaid of ribonucleoprotein complexes105,106.
Finally, RNA splicing has been reported in endo-symbionts107 and bacteriophages108, indicating that thisfeature is also not exclusive to eukaryotes (TABLE 1). Thesefindings highlight the ancient role of RNA and ribozymes
R E V I E W S
NATURE REVIEWS | MICROBIOLOGY VOLUME 9 | SEPTEMBER 2011 | 663
© 2011 Macmillan Publishers Limited. All rights reserved
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 7/12
′ ′
′
′ ′
and provide insight into putative functions of some of the still uncharacterized sRNAs. Group II self-splicingintrons are large catalytic molecules that are consideredto be related to the progenitor of spliceosome introns.
They were initially identified in the organelles of lowereukaryotes and plants and are also present in bacteria,although their frequency is unknown109. Genome-wideRNA-seq analysis could be used to obtain a global viewof all of these post-transcriptional processes, althoughthis would involve a modification of existing standardmapping methods to allow for the identification of added poly-base tails that are not present in the genomeor of the fusion of non-contiguous DNA sequences.
Leaderless mRNAs and regulation by UTRs
The Shine–Dalgarno ribosomal recognition sequencewas long considered to be essential for ribosome
attachment and efficient translation of mRNA. The lat-est transcriptomics studies have now demonstrated thatseveral RNA transcripts are leaderless: that is, that thesetranscripts start with one start codon (FIG. 2d) or have
non-canonical ribosome-binding sites110. The mecha-nism by which leaderless transcripts with no or a very short 5′ untranslated region (UTR) could be recog-nized by the ribosome has long been unclear. However,in E. coli, it was shown that the ATG initiation codoncan become the ribosome-binding sequence111 and thatleaderless transcripts can bind 70S ribosomes rather than30S ribosomal subunits, suggesting a different pathway of translation for these mRNA112. Genome-wide tran-scriptional studies have revealed that there are 26 leader-less RNAs in H. pylori 40 and 25 in S. Typhimurium82.These transcripts may be preferentially translatedwhen bulk mRNAs cannot be translated, inducing a
Figure 3 | Perspectives in transcriptomics. a | RNA modifications. There is growing evidence that RNA in bacteria can
undergo a number of post-transcriptional modifications that alter its stability and functionality. New ultrasequencing
methods coupled with classical biochemical approaches should allow comprehensive studies on these modifications and
reveal their function. b | Spatial organization of transcription and translation. The colocalization in space of functionally
related genes could drive the assembly of newly synthesized proteins into complexes143. The implementation of new
methodologies, such as chromosome conformation capture (3C) in bacteria129, will make it possible to obtain a
three-dimensional view of bacterial chromatin. c | Full-length transcripts. New sequencing technologies could provideoperon-long reads. The assembly-free sequencing of transcripts will allow the characterization of alternative transcript
termination and provide the chance to study the relationship between transcription initiation and termination. d | RNA–
protein interactions. The identification of RNA-binding proteins will be another means of gaining insight into the roles of
newly discovered RNAs. e | RNA secondary structure. High-throughput RNA structure determination should increase our
understanding of how non-coding RNAs, such as untranslated regions and small RNAs (sRNAs), function.f | Single-cell
transcriptomes. A substantial percentage of the transcripts are in the stochastic range. Studies in individual bacteria will
provide new biological insights into population heterogeneity and regulation. RNAP, RNA polymerase.
R E V I E W S
664 | SEPTEMBER 2011 | VOLUME 9 www.nature.com/reviews/micro
© 2011 Macmillan Publishers Limited. All rights reserved
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 8/12
non-transcriptional shift between proteins producedin non-optimal conditions, such as starvation or station-ary phase, when 70S monosomes prevail113. Consistentwith this, in E. coli, leaderless mRNAs are poorly trans-lated during the exponential phase and are only translatedwhen genes containing a canonical ribosome binding siteare not translated114; this exemplifies another regulatory mechanism at the level of translation.
Transcriptome studies have also revealed that many mRNAs contain a 5′ UTR of up to several hundred bases(the average length in M. pneumoniae is ~60 nucleo-tides)21. These regions may be regulating translationthrough secondary structure. Some of these long UTRsencode proteins or peptides; a classic example is theE. coli trp operon leader peptide, which controls pre-mature transcriptional termination depending ontryptophan availability 115. Alternatively, they can alsocontain riboswitches, which are regulatory regions thataffect mRNA stability and translation efficiency 40,60.Riboswitches are regulated by binding small moleculesthat act as environmental signals. Several classes of riboswitches that could regulate translation in responseto metabolites have been described17 (FIG. 2e). For exam-ple, the S-adenosylmethionine (SAM)-III riboswitch isa SAM-binding element that is found in the 5′ UTR of the metK gene, which encodes the SAM synthetase, in
Lactobacillus spp.116. SAM binding to SAM-III resultsin sequestration of part of the ribosome binding sitesequence of the metK mRNA, thus inhibiting ribo-some binding and leading to repression of translationinitiation116,117. Recent experiments have revealed thatSAM-III is a reversible riboswitch that can differentially regulate the transcript at various stages of the lifetime of the transcript, thus allowing the cell to respond rapidly to SAM fluctuations by regulating metK expression118.
There are other roles for UTRs. For instance, mogRmRNA in L. monocytogenes can be transcribed from twodifferent promoters. Transcription from the first pro-moter produces a long isoform that is induced only in the
stationary phase and generates a long 5′ UTR that over-laps with three genes. This 5′ UTR binds to the mRNAof the overlapping genes, leading to their degradation60 (FIG. 2f).
The abundance of leaderless mRNAs, as well asthe presence of mRNAs with long 5′ UTRs, suggests theexistence of a hidden and complex layer of regulationby the UTRs. Determination of the 5′ end of all mRNAscould elucidate different mechanisms of translationalregulation and guide further studies. As determiningthe structure of RNA is key to understanding its func-tion, another promising technique is the use of RNasesto determine the specific secondary structures of RNAs,which can be performed genome-wide. This allows base-paired regions within RNAs to be identified, revealingpotential riboswitches and other relevant structures.This method has been successfully applied to yeast119 and mouse transcriptomes120 and is valuable for bothUTRs and sRNAs.
Bacterial DNA is more than a random bundle
Like eukaryotes, bacteria contain nucleoid proteins thatbind DNA in a sequence-specific or non-sequence-specific manner, thereby promoting nucleoid compac-tion and domain formation and varying gene expressionlevels in response to environmental perturbations108.
E. coli contains around 450 nucleoid structure domains,which have an average length of around 10 kbp, with
variable boundaries and a random distribution along thechromosome121,122. Nucleoid compaction and domainformation of DNA in bacteria are expected to have aneffect on transcriptional regulation14,123. Because nucleoidstructure can be altered by changes in DNA supercoil-ing or changes in expression of histone-like proteins,it has been proposed that nucleoid reorganization anddynamics in E. coli could assume the role of a ‘transcrip-tion factor’ (REF. 124). For example, changes in DNAsupercoiling in E. coli induced by osmotic stress resultin major changes in gene expression118.
Table 1 | Bacterial transcriptome features that recall eukaryotic complexity
Feature Present in eukaryotes? Present in bacteria?
Small RNAs Yes Yes
Complex promoter regulation Yes, eukaryotes have many transcription factorsand other proteins
Yes, but simpler than in eukaryotes
Alternative transcripts Yes Yes, alternative promoters and terminators arepresent
RNA processing Yes Yes, but mainly related to the regulation of degradation, with few examples of specific processing
RNA splicing Yes Yes, but mainly restricted to organelles (mitochondriaand chloroplasts), tRNAs and ribosomal RNAs, andthere are few examples
Polyadenylation Yes, polyadenylation stabilizes RNA Yes, polyadenylation destabilizes RNA
Localized translation Yes, many mRNAs are transported to specific siteswhere they are translated
Yes, recent data show that mRNAs in bacteria donot diffuse and are translated on specific nucleoidlocalizations
Epigenetic modifications Yes Yes, but little information available
Impact of chromatin and nucleoidstructure on transcriptional regulation
Yes, supercoiling and chromatin domains have amajor impact on transcription
Yes, several examples have been found
R E V I E W S
NATURE REVIEWS | MICROBIOLOGY VOLUME 9 | SEPTEMBER 2011 | 665
© 2011 Macmillan Publishers Limited. All rights reserved
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 9/12
Occupancy studies in E. coli have shown the presenceof regions in the chromosome with contiguous proteinbinding of around 1.6 kbp in length, some of which cor-respond to active transcriptional regions, such as theribosomal RNA operon125. Interestingly, most E. coli orSalmonella spp. genes are expressed at a low level — onaverage, the mRNA/DNA ratio is less than one103 — andwould thus have little or no impact on nucleoid structure(FIG. 2g). In agreement with this observation, only very few loci have a high concentration of RNAP, as observedby electron microscopy 126. When cells are in the station-ary phase, the transcription factories disassemble, RNAPis distributed throughout the chromosome and the 3Dstructure becomes less compact (FIG. 2g). However,other regions with contiguous protein occupancy correspond to transcriptionally silent areas and could beinvolved in establishing chromatin domains125. Locally,chromatin domains induced by RNAP supercoiling canbe consolidated if the encoded protein is a membraneprotein127,128. This is owing to the fact that insertion of a protein into the membrane, or periplasm, during co-
transcriptional translation will tether its mRNA to thecell membrane, and this will enhance topological effectson the DNA127,128. Once established, chromatin domainscan induce a type of topological memory generating lin-eages with different expression states, thereby creatingcellular subpopulations that have the same genome butare primed to respond to various environments128.
High-throughput techniques have been applied to thedetermination of long-range DNA interactions and 3DDNA structure in vivo in eukaryotes: namely 3C129 and thelater variations circularized chromosome conformationcapture (4C), carbon-copy chromosome conforma-tion capture (5C)130 and Hi-C131. These techniques rely on crosslinking, restriction digestion and re-ligation of DNA regions that are close in space but not in the pri-mary sequence132. They have been applied to regions of the human genome133,134 and in a genome-wide study inyeast135. Similar studies in bacteria would help to unveilthe structure and dynamics of the bacterial chromosome,as well as its role in transcription regulation.
Epigenetics: memory of the bacterial nucleoid
Until recently, methylation of bacterial DNA was con-sidered to be used primarily to distinguish foreign DNA.However, DNA methylation has important roles in bac-teria, such as during chromosome partitioning, DNAreplication, DNA repair, timing of transposition and
conjugation136. In bacteria, cytosine can be methylated atN4 (m4C) or N5 (m5C) and adenine can be methylatedat N6 (m6A). It has been suggested that m4C is relatedto epigenetic transcriptional regulation136, and severalstudies have revealed that individual bacterial cells canhave a particular DNA methylation status that changestranscription of some operons in a specific, heritablemanner137,138. This occurs through a competition formethylation sites between a methylase and the proteinthat recognizes the unmethylated state (FIG. 2h); bindingof the protein that recognizes the unmethylated state willprevent access of the cognate methylase to the site, whichaffects transcription of the downstream genes. This effect
can be enhanced if the gene downstream encodes for theprotein that binds the unmethylated site and if its tran-scription requires a non-methylated status. If changesin environmental conditions result in degradation orinhibition of the protein protecting the methylation site,then the site will be methylated, blocking further tran-scription; this status will be inherited by the subsequentgenerations. A typical example is the pap pilin phase
variation125. The pap operon has two leucine-responsiveprotein (Lrp) binding sites. Binding of Lrp to the proxi-mal site blocks transcription and permits methylation of the distal site, which prevents Lrp from binding there. If poly(A) polymerase I is expressed, it dimerizes with Lrpand binds at the distal site, allowing methylation of theproximal site and therefore transcription of the operon.It is estimated that the methylation status of only a smallfraction of the approximately 20,000 methylated sites inthe E. coli chromosome can change when subjected todifferent conditions (for a review, see REF. 136).
Although DNA methylation is the best-known mech-anism involved in epigenetic regulation in bacteria, for-
mation of a possible feedback loop by other mechanismsalso permits inheritance of epigenetic states139. The latestultrasequencing technologies that sequence single DNAmolecules in real time have made it possible to deter-mine which bases are methylated, as these bases inducea delay in the polymerization process140. This opens thepossibility of complete genome-wide analyses of DNAmethylation at adenine and cytosine, which could bedetermined under various environmental perturbationsand thus provide insights into the role of epigenetics intranscriptional regulation140.
mRNA localization in bacteria
Organisms from all three domains of life can local-ize proteins to specific regions within a cell, either by targeting the protein with a signal that directs it to itsdestination or by locating the mRNA that encodes theprotein to the appropriate region. Both mechanismsoperate in eukaryotes, whereas only the first mecha-nism has been reported in bacteria141. Originally, track-ing single RNA molecules in E. coli showed localizedmotion consistent with Brownian motion of a polymerbound to the DNA and free diffusion142, but recent stud-ies are challenging the notion that mRNA in bacteria israndomly distributed. Analyses of mRNA localizationand diffusion in C. crescentus and E. coli have shownthat mRNA diffuses two orders of magnitude less than
would be expected in solution and that it restricts ribo-somal mobility 143. As a consequence, translation couldbe spatially organized by using the chromosome layoutas a template (FIG. 3b) and proteins encoded by neigh-bouring genes could be produced in close proximity.Therefore, spatial co-expression could promote imme-diate protein interactions and macromolecular complexformation143. If this proves to be a general phenomenon,it would justify the existence of operons encoding pro-tein complexes, as well as of chromosomal 3D domainsharbouring genes involved in related functions. Such3D clusters that include genes that are co-regulatedby a given transcription factor have been described in
R E V I E W S
666 | SEPTEMBER 2011 | VOLUME 9 www.nature.com/reviews/micro
© 2011 Macmillan Publishers Limited. All rights reserved
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 10/12
mammals, in which the term ‘transcriptional factories’has been proposed144.
Thus, the influence of chromatin structure on geneexpression and on the organization of RNA synthesis inspace is not an exclusive feature of eukaryotes. Recently,it was found that certain mRNAs in E. coli can migrateto the destination of their encoded proteins. This processis controlled by sequences within the RNA transmem-brane-coding sequence. Although the molecular mecha-nism is unknown, the data suggest an active transportof the mRNA to its translation location, as occurs ineukaryotes145. Other studies have shown that complexesinvolved in mRNA degradation show specific localization(for a recent review, see REF. 146).
If applied to all RNAs in a bacterium, technologiesthat allow the visualization of single RNA molecules inthe cell147,148 could elucidate whether mRNA localiza-tion is a widespread phenomenon. Determining dis-tinct RNA localizations could allow the identification of specific sequences that guide this localization, thereby opening a new research area.
Perspectives
Future directions should involve integrating our knowl-edge of transcription with the physiology of the micro-organism. Therefore, data should be complementedwith other levels of regulation. Technology based onsingle-molecule sequencing46 could represent the nextimportant advance in transcriptomics, as it should becapable of delivering reads with lengths up to tens of thousands of bases, allowing full-length polycistronictranscripts to be sequenced and providing directassociation of promoters and terminators (FIG. 3c).Currently, transcriptomes are constructed by usingshort-read libraries in combination with TSS detectionmethods30,40; this approach can provide information oninternal promoters but not on terminators, and neithercan it provide a complete list of the alternative tran-scripts that compose an operon. Genome-wide map-pings with long reads will not only identify the different5′ UTRs and 3′ UTRs but will also deconvolve the exactmolecular nature of the operon by sequencing full-length transcripts. Additionally, technologies based onsingle-molecule sequencing will substantially enhancethe speed at which data are acquired. Current Illuminaand Roche technologies measure the incorporatedbases by stopping the sequencing reaction. By contrast,new single-cell sequencing technologies can monitor
base incorporation in real time, thereby decreasingsequencing run length from days to hours46.
The number of known sRNAs is increasing every day,but the functions of many of these remain unclear. Newhigh-throughput methods for detecting RNA–proteininteractions are expected to reveal important biological
insights into RNA function (FIG. 3d) and the regulationof protein activity by RNAs. Despite the important rolethat the 3D structure of RNA can have in its function,relatively few RNA structures have been determinedexperimentally. For instance, in several trans-actingsRNAs, only a part of the sequence is involved in basepairing with the target. Based on a 3D structure, it is pos-sible to hypothesize which residues are available to basepair. Parallel analysis of RNA structure (PARS), a newmethod for studying RNA secondary structure genome-wide, obtains information about the secondary structureof RNA by treating RNA with structure-specific enzymesand then subjecting the fragments to deep sequenc-ing. PARS was initially developed for mRNAs, but themethod could also be used on bacterial sRNA fractionsand even entire transcriptomes119 (FIG. 3e).
Transcript levels for the majority of the RNAs aregenerally low (the abundance of mRNA for genes withmore than 100 protein copies per cell ranges from 0.05to 5 mRNAs per cell149). This, together with the fact thattranscription is a stochastic process (as it deals with
very few sites in a large chromosome, and transcriptionfactors are bound nonspecifically to DNA and diffusealong DNA for most of the time150) results in large tem-poral variations among cells149 (FIG. 3f). Gene expres-sion heterogeneity within individual cells is essentialfor various processes151. The single-cell transcriptomicsdata that are needed to understand this stochasticity ineukaryotes are already available152, so a major challengeis to provide similar single-cell data sets for bacteria.
In addition to these future technical prospects,integration of data that combine different sourcesof information is needed to provide a global picture of transcription.
As mentioned above, chromatin organization and epi-genetics in bacteria could play a major part in transcrip-tion and translation. Determining the role of genomicarchitecture and dynamics, together with understand-ing the function of the massive amount of sRNAs,presents the next important challenge in decipheringthe mechanisms of regulation of gene expression at thetranscriptional and translational levels. Therefore,the evidence that non-canonical DNA-binding proteinscould have a role in shaping the nucleoid architecturethat links metabolism and transcription is highly relevant.In this scenario153, obtaining a global overview of DNA-binding proteins, and disentangling their effects on tran-scription and metabolism, would greatly enhance the
design of the multilayer models required to understandthe complexity of a biological system.
Note added in proof
Recently, a paper describing single-cell RNA analysis inbacteria was published160.
1. Jacob, F. & Monod, J. [Genes of structure and genes of
regulation in the biosynthesis of proteins]. C. R. Hebd
Seances Acad. Sci. 249, 1282–1284 (1959).
2. Jacob, F. & Monod, J. Genetic regulatory mechanisms
in the synthesis of proteins. J. Mol. Biol. 3, 318–356
(1961).
3. Wosten, M. M. Eubacterial sigma-factors. FEMS
Microbiol. Rev. 22, 127–150 (1998).
4. Paget, M. S. & Helmann, J. D. The sigma70 family of
sigma factors. Genome Biol. 4, 203 (2003).
5. Perez-Rueda, E. & Collado-Vides, J. The repertoire of
DNA-binding transcriptional regulators in Escherichia
coli K-12. Nucleic Acids Res. 28, 1838–1847 (2000).
6. Santangelo, T. J. & Artsimovitch, I. Termination and
antitermination: RNA polymerase runs a stop sign
Nature Rev. Microbiol. 9, 319–329 (2011).
7. Shine, J. & Dalgarno, L. Determinant of cistron
specificity in bacterial ribosomes. Nature 254, 34–38
(1975).
8. Sigmund, C. D. & Morgan, E. A. Nus A protein
affects transcriptional pausing and termination
in vitro by binding to different sites on the
transcription complex. Biochemistry 27, 5622–5627
(1988).
R E V I E W S
NATURE REVIEWS | MICROBIOLOGY VOLUME 9 | SEPTEMBER 2011 | 667
© 2011 Macmillan Publishers Limited. All rights reserved
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 11/12
9. Ma, J. C., Newman, A. J. & Hayward, R. S. Internal
promoters of the rpoBC operon of Escherichia coli .Mol. Gen. Genet. 184, 548–550 (1981).
10. Andre, G. et al. S-box and T-box riboswitches and
antisense RNA control a sulfur metabolic operon of
Clostridium acetobutylicum. Nucleic Acids Res. 36,
5955–5969 (2008).
11. Winkler, W., Nahvi, A. & Breaker, R. R. Thiamine
derivatives bind messenger RNAs directly to regulate
bacterial gene expression. Nature 419, 952–956
(2002).
12. Boni, I. V., Artamonova, V. S., Tzareva, N. V. & Dreyfus, M. Non-canonical mechanism for translational
control in bacteria: synthesis of ribosomal protein S1.
EMBO J. 20, 4222–4232 (2001).
13. Laursen, B. S., Sorensen, H. P., Mortensen, K. K. &
Sperling-Petersen, H. U. Initiation of protein synthesis
in bacteria. Microbiol. Mol. Biol. Rev. 69, 101–123
(2005).
14. Sorek, R. & Cossart, P. Prokaryotic transcriptomics: a
new view on regulation, physiology and pathogenicity.
Nature Rev. Genet. 11, 9–16 (2010).
A complete compilation of advances in bacterialtranscriptomics.
15. Serganov, A. & Patel, D. J. Ribozymes, riboswitches
and beyond: regulation of gene expression without
proteins. Nature Rev. Genet. 8, 776–790 (2007).
16. Makita, Y., Nakao, M., Ogasawara, N. & Nakai, K.
DBTBS: database of transcriptional regulation in
Bacillus subtilis and its contribution to comparative
genomics. Nucleic Acids Res. 32, D75–D77 (2004).
17. de Hoon, M. J., Makita, Y., Nakai, K. & Miyano, S.
Prediction of transcriptional terminators in Bacillus
subtilis and related species. PLoS Comput. Biol. 1,
e25 (2005).
18. Vogel, J. et al. RNomics in Escherichia coli detects new
sRNA species and indicates parallel transcriptional
output in bacteria. Nucleic Acids Res. 31, 6435–6443
(2003).
19. Schena, M., Shalon, D., Davis, R. W. & Brown, P. O.
Quantitative monitoring of gene expression patterns
with a complementary DNA microarray. Science 270,
467–470 (1995).
20. Selinger, D. W. et al. RNA expression analysis using a
30 base pair resolution Escherichia coli genome array.
Nature Biotech. 18, 1262–1268 (2000).
21. Guell, M. et al. Transcriptome complexity in a genome-
reduced bacterium. Science 326, 1268–1271 (2009).
22. McGrath, P. T. et al. High-throughput identification of
transcription start sites, conserved promoter motifs
and predicted regulons. Nature Biotech. 25, 584–592
(2007).
23. Rasmussen, S., Nielsen, H. B. & Jarmer, H. Thetranscriptionally active regions in the genome of
Bacillus subtilis. Mol. Microbiol. 73, 1043–1057
(2009).
24. Perocchi, F., Xu, Z., Clauder-Munster, S. & Steinmetz,
L. M. Antisense artifacts in transcriptome microarray
experiments are resolved by actinomycin D. Nucleic
Acids Res. 35, e128 (2007).
25. Johnson, J. M., Edwards, S., Shoemaker, D. & Schadt,
E. E. Dark matter in the genome: evidence of
widespread transcription detected by microarray tiling
experiments. Trends Genet. 21, 93–102 (2005).26. Royce, T. E. et al. Issues in the analysis of
oligonucleotide tiling microarrays for transcript
mapping. Trends Genet. 21, 466–475 (2005).
27. Wilhelm, B. T. et al. Dynamic repertoire of a eukaryotic
transcriptome surveyed at single-nucleotide
resolution. Nature 453, 1239–1243 (2008).
28. Wang, Z., Gerstein, M. & Snyder, M. RNA-Seq: a
revolutionary tool for transcriptomics. Nature Rev.
Genet. 10, 57–63 (2009). A good introduction to RNA-seq.
29. Vivancos, A. P., Guell, M., Dohm, J. C. , Serrano, L. &
Himmelbauer, H. Strand specific deep sequencing of
the transcriptome. Genome Res. 19, 255–265
(2009).30. Valen, E. et al. Genome-wide detection and analysis of
hippocampus core promoters using DeepCAGE.
Genome Res. 19, 255–265 (2009).
31. Wurtzel, O. et al. A single-base resolution map of an
archaeal transcriptome. Genome Res. 20, 133–141
(2010).
32. Sharma, A. et al. Helicobacter pylori single-stranded
DNA binding protein — functional characterization
and modulation of H. pylori DnaB helicase activity.
FEBS J. 276, 519–531 (2009).33. Frias-Lopez, J. et al. Microbial community gene
expression in ocean surface waters. Proc. Natl Acad.
Sci. USA 105, 3805–3810 (2008).
34. Gloor, G. B. et al. Microbiome profiling by illumina
sequencing of combinatorial sequence-tagged PCR
products. PLoS ONE 5, e15406 (2010).
35. Murakawa, G. J., Kwan, C., Yamashita, J. & Nierlich,
D. P. Transcription and decay of the lac messenger:
role of an intergenic terminator. J. Bacteriol. 173,
28–36 (1991).
36. Laing, E., Mersinias, V., Smith, C. P. & Hubbard, S. J.
Analysis of gene expression in operons of Streptomyces coelicolor . Genome Biol. 7, R46 (2006).
37. Petersohn, A., Antelmann, H., Gerth, U. & Hecker, M.
Identification and transcriptional analysis of newmembers of the sigmaB regulon in Bacillus subtilis.
Microbiology145, 869–880 (1999).
38. Perkins, J. B., Bower, S., Howitt, C. L., Yocum, R. R. &
Pero, J. Identification and characterization of
transcripts from the biotin biosynthetic operon of
Bacillus subtilis. J. Bacteriol. 178, 6361–6365 (1996).
39. McDaniel, B. A., Grundy, F. J., Artsimovitch, I. &
Henkin, T. M. Transcription termination control
of the S box system: direct measurement of
S -adenosylmethionine by the leader RNA. Proc. Natl
Acad. Sci. USA 100, 3083–3088 (2003).
40. Sharma, C. M. et al. The primary transcriptome of the
major human pathogen Helicobacter pylori . Nature
464, 250–255 (2010).41. Lee, H. J., Jeon, H. J., Ji, S. C., Yun, S. H. & Lim, H. M.
Establishment of an mRNA gradient depends on the
promoter: an investigation of polarity in gene
expression. J. Mol. Biol. 378, 318–327 (2008).
42. Haugen, S. P., Ross, W. & Gourse, R. L. Advances in
bacterial promoter recognition and its control by
factors that do not bind DNA. Nature Rev. Microbiol.
6, 507–519 (2008).
43. Kuhner, S. et al. Proteome organization in a genome-
reduced bacterium. Science 326, 1235–1240 (2009).
44. Proshkin, S., Rahmouni, A. R., Mironov, A. &
Nudler, E. Cooperation between translating ribosomes
and RNA polymerase in transcription elongation.
Science 328, 504–508 (2010).45. Clarke, J. et al. Continuous base identification for
single-molecule nanopore DNA sequencing. Nature
Nanotechnol. 4, 265–270 (2009).
46. Eid, J. et al. Real-time DNA sequencing from single
polymerase molecules. Science 323, 133–138 (2009).
47. Churchman, L. S. & Weissman, J. S. Nascent transcript
sequencing visualizes transcription at nucleotide
resolution. Nature 469, 368–373 (2011).
48. Barnard, A., Wolfe, A. & Busby, S. Regulation at
complex bacterial promoters: how bacteria use different
promoter organizations to produce different regulatory
outcomes. Curr. Opin. Microbiol. 7, 102–108 (2004).
49. Kovacikova, G. & Skorupski, K. Overlapping bindingsites for the virulence gene regulators AphA, AphB
and cAMP-CRP at the Vibrio cholera tcpPH promoter.
Mol. Microbiol. 41, 393–407 (2001).
50. Ptashne, M. Regulation of transcription: from lambda
to eukaryotes. Trends Biochem. Sci. 30, 275–279
(2005).
51. Xu, Z. et al. Bidirectional promoters generate
pervasive transcription in yeast. Nature 457,
1033–1037 (2009).
52. Trinklein, N. D. et al. An abundance of bidirectional
promoters in the human genome. Genome Res. 14,
62–66 (2004).
53. Escolar, L., Pérez-Martin, J. & de Lorenzo, V.
Coordinated repression in vitro of the divergent fepA-
fes promoters of Escherichia coli by the iron uptake
regulation (Fur) protein. J. Bacteriol. 180,
2579–2582 (1998).
54. Nieuwkoop, A. J., Baldauf, S. A., Hudspeth, M. E. &
Bender, R. A. Bidirectional promoter in the hut (P )
region of the histidine utilization (hut ) operons fromKlebsiella aerogenes. J. Bacteriol. 170, 2240–2246
(1988).
55. Li, Y. Y. et al. Systematic analysis of head-to-head gene
organization: evolutionary conservation and potential
biological relevance. PLoS Comput. Biol. 2, e74 (2006).
56. Munson, G. P. & Scott, J. R. Rns, a virulence regulator
within the AraC family, requires binding sites upstream
and downstream of its own promoter to function as an
activator. Mol. Microbiol. 36, 1391–1402 (2000).
57. Isalan, M. et al. Evolvability and hierarchy in rewired
bacterial gene networks. Nature 452, 840–845
(2008).
58. Matthews, K. S. DNA looping. Microbiol. Rev. 56,
123–136 (1992).
59. Shin, M. et al. DNA looping-mediated repression by
histone-like protein H-NS: specific requirement of
Esigma70 as a cofactor for looping. Genes Dev. 19,
2388–2398 (2005).
60. Toledo-Arana, A. et al. The Listeria transcriptional
landscape from saprophytism to virulence. Nature
459, 950–956 (2009).
Complete article describing a major pathogentranscriptome.
61. Hershberg, R., Altuvia, S. & Margalit, H. A survey of
small RNA-encoding genes in Escherichia coli . Nucleic
Acids Res. 31, 1813–1820 (2003).
62. Brantl, S. Regulatory mechanisms employed by cis-
encoded antisense RNAs. Curr. Opin. Microbiol. 10,
102–109 (2007).
63. Delihas, N. Regulation of gene expression by trans-encoded antisense RNAs. Mol. Microbiol. 15, 411–414
(1995).
64. Storz, G., Opdyke, J. A. & Zhang, A. Controlling mRNA
stability and translation with small, noncoding RNAs.Curr. Opin. Microbiol. 7, 140–144 (2004).
65. Gottesman, S. & Storz, G. Bacterial small RNA
regulators: versatile roles and rapidly evolving
variations. Cold Spring Harb. Perspect. Biol. 27 Oct
2010 (doi: 10.1101/cshperspect.a003798).
66. Hayes, C. S. & Keiler, K. C. Beyond ribosome rescue:
tmRNA and co-translational processes. FEBS Lett.
584, 413–419 (2010).
67. Wassarman, K. M. 6S RNA: a small RNA regulator of
transcription. Curr. Opin. Microbiol. 10, 164–168
(2007).
68. Waters, L. S. & Storz, G. Regulatory RNAs in bacteria.
Cell 136, 615–628 (2009).
69. Cook, D. N., Ma, D., Pon, N. G. & Hearst, J. E.
Dynamics of DNA supercoiling by transcription in
Escherichia coli . Proc. Natl Acad. Sci. USA 89,
10603–10607 (1992).70. Blot, N., Mavathur, R., Geertz, M., Travers, A. &
Muskhelishvili, G. Homeostatic regulation of
supercoiling sensitivity coordinates transcription of the
bacterial genome. EMBO Rep. 7, 710–715 (2006).
71. Gottesman, S. Micros for microbes: non-coding
regulatory RNAs in bacteria. Trends Genet. 21,
399–404 (2005). A good review of the roles of sRNAs in bacteria.
72. Bejerano-Sagie, M. & Xavier, K. B. The role of small
RNAs in quorum sensing. Curr. Opin. Microbiol. 10,
189–198 (2007).
73. Masse, E., Salvail, H., Desnoyers, G. & Arguin, M.
Small RNAs controlling iron metabolism. Curr. Opin.
Microbiol. 10, 140–145 (2007).74. Papenfort, K., Bouvier, M., Mika, F., Sharma, C. M. &
Vogel, J. Evidence for an autonomous 5′ target
recognition domain in an Hfq-associated small RNA.
Proc. Natl Acad. Sci. USA 107, 20435–20440 (2010).
75. Balbontin, R., Fiorini, F., Figueroa-Bossi, N.,
Casadesus, J. & Bossi, L. Recognition of heptamericseed sequence underlies multi-target regulation by
RybB small RNA in Salmonella enterica. Mol.
Microbiol. 78, 380–394 (2010).
76. Mitarai, N., Andersson, A. M., Krishna, S., Semsey, S.
& Sneppen, K. Efficient degradation and expression
prioritization with small RNAs. Phys. Biol. 4, 164–171
(2007).
77. Qiu, Y. et al. Structural and operational complexity of
the Geobacter sulfurreducens genome. Genome Res.
20, 1304–1311 (2010).
78. Dornenburg, J. E., Devita, A. M., Palumbo, M. J. &
Wade, J. T. Widespread antisense transcription in
Escherichia coli . MBio 1, e00024-10 (2010).
79. Ge, X., Wu, Q., Jung, Y. C., Chen, J. & Wang, S. M.
A large quantity of novel human antisense transcripts
detected by LongSAGE. Bioinformatics 22,
2475–2479 (2006).
80. Henz, S. R. et al. Distinct expression patterns of
natural antisense transcripts in Arabidopsis. Plant
Physiol. 144, 1247–1255 (2007).81. Yoder-Himes, D. R. et al. Mapping the Burkholderia
cenocepacianiche response via high-throughput
sequencing. Proc. Natl Acad. Sci. USA 106,
3976–3981 (2009).
82. Perkins, T. T. et al. A strand-specific RNA–Seq analysis
of the transcriptome of the typhoid bacillus Salmonella
Typhi. PLoS Genet. 5, e1000569 (2009).
83. Sittka, A. et al. Deep sequencing analysis of small
noncoding RNA and mRNA targets of the global post-
transcriptional regulator, Hfq. PLoS Genet. 4,
e1000163 (2008).
84. Passalacqua, K. D. et al. Structure and complexity of a
bacterial transcriptome. J. Bacteriol. 191, 3203–3211
(2009).
85. Denoeud, F. et al. Prominent use of distal 5′
transcription start sites and discovery of a large
number of additional exons in ENCODE regions.
Genome Res. 17, 746–759 (2007).
R E V I E W S
668 | SEPTEMBER 2011 | VOLUME 9 www.nature.com/reviews/micro
© 2011 Macmillan Publishers Limited. All rights reserved
8/3/2019 Bacterial Transcriptomics What is Beyond the RNA Horiz-ome
http://slidepdf.com/reader/full/bacterial-transcriptomics-what-is-beyond-the-rna-horiz-ome 12/12
86. Schluter, J. P. et al. A genome-wide survey of sRNAs in
the symbiotic nitrogen fixing alpha-proteobacteriumSinorhizobium meliloti . BMC Genomics 11, 245 (2010).
87. Wassarman, K. M. & Saecker, R. M. Synthesis-
mediated release of a small RNA inhibitor of RNA
polymerase. Science 314, 1601–1603 (2006).88. Novick, R. P. & Geisinger, E. Quorum sensing in
staphylococci. Annu. Rev. Genet. 42, 541–564 (2008).89. Vanderpool, C. K. & Gottesman, S. Involvement of a
novel transcriptional activator and small RNA in post-
transcriptional regulation of the glucose
phosphoenolpyruvate phosphotransferase system.Mol. Microbiol. 54, 1076–1089 (2004).
90. Wadler, C. S. & Vanderpool, C. K. A dual function for a
bacterial small RNA: SgrS performs base pairing-
dependent regulation and encodes a functional
polypeptide. Proc. Natl Acad. Sci. USA 104,
20454–20459 (2007).91. Maki, K., Morita, T., Otaka, H. & Aiba, H. A minimal
base-pairing region of a bacterial small RNA SgrS
required for translational repression of ptsG mRNA.Mol. Microbiol. 76, 782–792 (2010).
92. Bouamrani, A. et al. Mesoporous silica chips for
selective enrichment and stabilization of low molecular
weight proteome. Proteomics 10, 496–505 (2010).93. Chang, S. J., Hsieh, S. Y., Yuan, H. S. & Chak, K. F.
Characterization of the specific cleavage of ceiE7-mRNA
of the bactericidal ColE7 operon. Biochem. Biophys.
Res. Commun. 299, 613–620 (2002).
94. Anderson, K. L. & Dunman, P. M. Messenger RNA
turnover processes in Escherichia coli , Bacillus
subtilis, and emerging studies in Staphylococcus
aureus. Int. J. Microbiol. 2009, 525491 (2009).95. Deana, A., Celesnik, H. & Belasco, J. G. The bacterial
enzyme RppH triggers messenger RNA degradation
by 5′ pyrophosphate removal. Nature 451, 355–358
(2008).96. Richards, J., Sundermeier, T., Svetlanov, A. & Karzai,
A. W. Quality control of bacterial mRNA decoding and
decay. Biochim. Biophys. Acta 1779, 574–582 (2008).97. Kime, L., Jourdan, S. S., Stead, J. A., Hidalgo-Sastre, A.
& McDowall, K. J. Rapid cleavage of RNA by RNase E
in the absence of 5′ monophosphate stimulation. Mol.
Microbiol. 76, 590–604 (2010).98. Mohanty, B. K. & Kushner, S. R. Bacterial/archaeal/
organellar polyadenylation. WIREs RNA 2, 256–276
(2010).99. Steege, D. A. Emerging features of mRNA decay in
bacteria. RNA 6, 1079–1090 (2000).
100. Gopalakrishna, Y., Langley, D. & Sarkar, N. Detection
of high levels of polyadenylate-containing RNA in
bacteria by the use of a single-step RNA isolation
procedure. Nucleic Acids Res. 9, 3545–3554 (1981).101. Mohanty, B. K. & Kushner, S. R. The majority of Escherichia coli mRNAs undergo post-transcriptional
modification in exponentially growing cells. Nucleic
Acids Res. 34, 5695–5704 (2006).102. Jasiecki, J. & Wegrzyn, G. Growth-rate dependent
RNA polyadenylation in Escherichia coli . EMBO Rep.
4, 172–177 (2003).103. Bernstein, J. A., Khodursky, A. B., Lin, P. H., Lin-Chao, S.
& Cohen, S. N. Global analysis of mRNA decay and
abundance in Escherichia coli at single-gene resolution
using two-color fluorescent DNA microarrays. Proc.
Natl Acad. Sci. USA 99, 9697–9702 (2002).The first global analysis of RNA decay in bacteria.
104. Li, Z., Reimers, S., Pandit, S. & Deutscher, M. P. RNA
quality control: degradation of defective transfer RNA.
EMBO J. 21, 1132–1138 (2002).
105. Schmitz-Linneweber, C. et al. A pentatricopeptide
repeat protein facilitates the trans-splicing of the
maize chloroplast rps12 pre-mRNA. Plant Cell 18,
2650–2663 (2006).
106. Seiwert, S. D. & Stuart, K. RNA editing: transfer of genetic information from gRNA to precursor mRNA
in vitro. Science 266, 114–117 (1994).107. Belfort, M. Self-splicing introns in prokaryotes:
migrant fossils? Cell 64, 9–11 (1991).108. Chu, F. K., Maley, G. F., Maley, F. & Belfort, M.
Intervening sequence in the thymidylate synthase
gene of bacteriophage T4. Proc. Natl Acad. Sci. USA
81, 3049–3053 (1984).109. Toro, N. Bacteria and Archaea Group II introns:
additional mobile genetic elements in the
environment. Environ. Microbiol. 5, 143–151 (2003).110. Krishnan, K. M., Van Etten, W. J. & Janssen, G. R.
Proximity of the start codon to a leaderless mRNA’s
5′ terminus is a strong positive determinant of
ribosome binding and expression in Escherichia coli . J. Bacteriol. 192, 6482–6485.
111. Brock, J. E., Pourshahian, S., Giliberti, J., Limbach,
P. A. & Janssen, G. R. Ribosomes bind leaderless
mRNA in Escherichia coli through recognition of their
5′-terminal AUG. RNA 14, 2159–2169 (2008).112. O’Donnell, S. M. & Janssen, G. R. Leaderless mRNAs
bind 70S ribosomes more strongly than 30S
ribosomal subunits in Escherichia coli . J. Bacteriol.
184, 6730–6733 (2002).113. Moll, I., Hirokawa, G., Kiel, M. C., Kaji, A. & Blasi, U.
Translation initiation with 70S ribosomes: an
alternative pathway for leaderless mRNAs. Nucleic
Acids Res. 32, 3354–3363 (2004).
114. Grill, S., Gualerzi, C. O., Londei, P. & Blasi, U.
Selective stimulation of translation of leaderless
mRNA by initiation factor 2: evolutionary implications
for translation. EMBO J. 19, 4101–4110 (2000).115. Yanofsky, C. Attenuation in the control of expression of
bacterial operons. Nature 289, 751–758 (1981).
116. Fuchs, R. T., Grundy, F. J. & Henkin, T. M. The SMK
box
is a new SAM-binding RNA for translational regulation
of SAM synthetase. Nature Struct. Mol. Biol. 13,
226–233 (2006).117. Fuchs, R. T., Grundy, F. J. & Henkin, T. M.
S -adenosylmethionine directly inhibits binding of 30S
ribosomal subunits to the SMK box translational
riboswitch RNA. Proc. Natl Acad. Sci. USA 104,
4876–4880 (2007).118. Smith, A. M., Fuchs, R. T., Grundy, F. J. & Henkin, T. M.
The SAM-responsive SMK
box is a reversible riboswitch.Mol. Microbiol. 78, 1393–1402 (2010).
119. Kertesz, M. et al. Genome-wide measurement of RNA
secondary structure in yeast. Nature 467, 103–107
(2010).120. Underwood, J. G. et al. FragSeq: transcriptome-wide
RNA structure probing using high-throughputsequencing. Nature Methods 7, 995–1001 (2010).References 119 and 120 describe promisingmethods for determining RNA structure in bacteria.
121. Postow, L., Hardy, C. D., Arsuaga, J. & Cozzarelli, N. R.
Topological domain structure of the Escherichia coli
chromosome. Genes Dev. 18, 1766–1779 (2004).122. Gitai, Z., Thanbichler, M. & Shapiro, L. The
choreographed dynamics of bacterial chromosomes.Trends Microbiol. 13, 221–228 (2005).
123. Dillon, S. C. & Dorman, C. J. Bacterial nucleoid-
associated proteins, nucleoid structure and gene
expression. Nature Rev. Microbiol. 8, 185–195 (2010).124. Peter, B. J. et al. Genomic transcriptional response to
loss of chromosomal supercoiling in Escherichia coli .Genome Biol. 5, R87 (2004).
125. Vora, T., Hottes, A. K. & Tavazoie, S. Protein occupancy
landscape of a bacterial genome. Mol. Cell 35,
247–253 (2009).126. French, S. L. & Mil ler, O. L. Jr. Transcription mapping
of the Escherichia coli chromosome by electronmicroscopy. J. Bacteriol. 171, 4207–4216 (1989).127. Binenbaum, Z., Parola, A. H., Zaritsky, A. & Fishov, I.
Transcription- and translation-dependent changes in
membrane dynamics in bacteria: testing the
transertion model for domain formation. Mol.
Microbiol. 32, 1173–1182 (1999).128. Deng, S., Stein, R. A. & Higgins, N. P. Organization of
supercoil domains and their reorganization by
transcription. Mol. Microbiol. 57, 1511–1521 (2005).
129. Dekker, J., Rippe, K., Dekker, M. & Kleckner, N.
Capturing chromosome conformation. Science 295,
1306–1311 (2002).130. Dostie, J. et al. Chromosome Conformation Capture
Carbon Copy (5C): a massively parallel solution for
mapping interactions between genomic elements.
Genome Res. 16, 1299–1309 (2006).
131. van Berkum, N. L. et al. Hi-C: a method to study the
three-dimensional architecture of genomes. http://
www.jove.com/details.php?id=1869 doi:
10.3791/1869. J. Vis. Exp. 39 (2010).
132. Vassetzky, Y. et al. Chromosome conformation capture(from 3C to 5C) and its ChIP-based modification.
Methods Mol. Biol. 567, 171–188 (2009).133. Tolhuis, B., Palstra, R. J., Splinter, E., Grosveld, F. & de
Laat, W. Looping and interaction between
hypersensitive sites in the active β-globin locus. Mol.
Cell 10, 1453–1465 (2002).
134. Dhar, S. S. & Wong-Riley, M. T. Chromosome
conformation capture of transcriptional interactions
between cytochrome c oxidase genes and genes of
glutamatergic synaptic transmission in neurons. J. Neurochem. 115, 676–683 (2010).
135. Duan, Z. et al. A three-dimensional model of the yeast
genome. Nature 465, 363–367 (2010).
136. Casadesus, J. & Low, D. Epigenetic gene regulation in
the bacterial world. Microbiol. Mol. Biol. Rev. 70,
830–856 (2006).137. Sohanpal, B. K., El-Labany, S., Lahooti, M.,
Plumbridge, J. A. & Blomfield, I. C. Integrated
regulatory responses of fimB to N -acetylneuraminic
(sialic) acid and GlcNAc in Escherichia coli K-12. Proc.
Natl Acad. Sci. USA 101, 16322–16327 (2004).
138. Blyn, L. B., Braaten, B. A. & Low, D. A. Regulation of
pap pilin phase variation by a mechanism involving
differential dam methylation states. EMBO J. 9,
4045–4054 (1990).139. Turner, K. H., Vallet-Gely, I. & Dove, S. L. Epigenetic
control of virulence gene expression in Pseudomonas
aeruginosaby a LysR-type transcription regulator.
PLoS Genet. 5, e1000779 (2009).
140. Flusberg, B. A. et al. Direct detection of DNA
methylation during single-molecule, real-time
sequencing. Nature Methods 7, 461–465 (2010).141. Shapiro, L. & Losick, R. Protein localization and cell
fate in bacteria. Science 276, 712–718 (1997).
142. Golding, I. & Cox, E. C. RNA dynamics in live
Escherichia coli cells. Proc. Natl Acad. Sci. USA 101,
11310–11315 (2004).143. Llopis, P. M. et al. Spatial organization of the flow of
genetic information in bacteria. Nature 466, 77–81
(2010).144. Schoenfelder, S. et al. Preferential associations between
co-regulated genes reveal a transcriptional interactome
in erythroid cells. Nature Genet. 42, 53–61 (2010).145. Nevo-Dinur, K., Nussbaum-Shochat, A., Ben-Yehuda, S.
& Amster-Choder, O. Translation-independent
localization of mRNA in E. coli . Science 331,
1081–1084 (2011).
146. Evguenieva-Hackenberg, E., Roppelt, V., Lassek, C. &
Klug, G. Subcellular localization of RNA degrading
proteins and protein complexes in prokaryotes. RNA
Biol. 8, 49–54 (2011).147. Femino, A. M., Fay, F. S., Fogarty, K. & Singer, R. H.
Visualization of single RNA transcripts in situ. Science
280, 585–590 (1998).
148. Santangelo, P. J. et al. Single molecule-sensitive
probes for imaging RNA in live cells. Nature Methods
6, 347–349 (2009).149. Taniguchi, Y. et al. Quantifying E. coli proteome and
transcriptome with single molecule sensitivity in single
cells. Science 329, 533–538 (2010).
150. Elf, J., Li, G. W. & Xie, X. S. Probing transcription
factor dynamics at the single molecule level in a living
cell. Science 316, 1191–1194 (2007).151. Suel, G. M., Garcia-Ojalvo, J., Liberman, L. M. &
Elowitz, M. B. An excitable gene regulatory circuit
induces transient cellular differentiation. Nature 440,
545–550 (2006).
152. Tang, F. et al. mRNA-Seq whole-transcriptome analysis
of a single cell. Nature Methods 6, 377–382 (2009).153. Hardy, C. D. & Cozzarelli, N. R. A genetic selection for
supercoiling mutants of Escherichia coli revealsproteins implicated in chromosome structure. Mol.
Microbiol. 57, 1636–1652 (2005).
154. Flicek, P. & Birney, E. Sense from sequence reads:
methods for alignment and assembly. Nature Methods
6, S6–S12 (2009).155. Li, R., Li, Y., Kristiansen, K. & Wang, J. SOAP: short
oligonucleotide alignment program. Bioinformatics
24, 713–714 (2008).
156. Li, H., Ruan, J. & Durbin, R. Mapping short DNA
sequencing reads and calling variants using mapping
quality scores. Genome Res. 18, 1851–1858 (2008).157. Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L.
Ultrafast and memory efficient alignment of short
DNA sequences to the human genome. Genome Biol.
10, R25 (2009).
158. Li, H. & Durbin, R. Fast and accurate short read
alignment with Burrows-Wheeler transform.Bioinformatics 25, 1754–1760 (2009).
159. Li, R. et al. SOAP2: an improved ultrafast tool for short
read alignment. Bioinformatics 25, 1966–1967 (2009).
160. Kang, Y. et al. Transcript amplification from singlebacterium for transcriptome analysis, Genome Res.
32, 925–935 (2011).
AcknowledgementsWe would like to thank M. Isalan and B. Lehner for critical
reading of the manuscript. This work was supported by the
Consolider programme of the Spanish Ministry of Research,
the Fundación Marcelino Botín and the European Research
Council.
Competing interests statementThe authors declare no competing financial interests.
FURTHER INFORMATIONAuthors’ homepage: http://serrano.crg.es
Bioconductor: http://www.bioconductor.org
ALL LINKS ARE ACTIVE IN THE ONLINE PDF
R E V I E W S
NATURE REVIEWS | MICROBIOLOGY VOLUME 9 | SEPTEMBER 2011 | 669