Upload
aaron-marc-saunders
View
839
Download
13
Embed Size (px)
DESCRIPTION
A brief introduction to amplicon sequencing of the 16S rRNA gene for the analysis of microbial diversity. This talk was presented originally at the Workshop: Introduction to Systems Biology, Aalborg Denmark. 2013-10-29
Citation preview
Introduction To Community Systems Microbiology, Aalborg 2013
Amplicon Sequencing
Aaron Marc Saunders
Introduction To Community Systems Microbiology, Aalborg 2013
Introduction To Community Systems Microbiology, Aalborg 2013
Bacterial diversity
cultured representative
Phylum ActinobacteriaGenus Tetrasphaera
Introduction To Community Systems Microbiology, Aalborg 2013
Environmental diversity
Environmental sample mixed genomic DNA
DNA extraction
Introduction To Community Systems Microbiology, Aalborg 2013
Environmental surveys
● Marker gene Sequencing– PCR product
– One gene fragment
● Metagenomics– Total DNA
– All genes
Introduction To Community Systems Microbiology, Aalborg 2013
Marker Genes
http://fungene.cme.msu.edu/
Introduction To Community Systems Microbiology, Aalborg 2013
”Tree of Life”
Based on 16S rRNA gene
Introduction To Community Systems Microbiology, Aalborg 2013
Start
Stop
Introduction To Community Systems Microbiology, Aalborg 2013
Hypervariable regions
E. coli
V4
All life!
Introduction To Community Systems Microbiology, Aalborg 2013
16S rRNA gene conservation
Regions denoted ”hypervariable” http://www.bioinformatics-toolkit.org
Introduction To Community Systems Microbiology, Aalborg 2013
Environmental surveys
Environmental sample
mixed genomic DNA
PCR product16S clone library
Introduction To Community Systems Microbiology, Aalborg 2013
Introduction To Community Systems Microbiology, Aalborg 2013
Environmental surveys
Environmental sample
mixed genomic DNA
metagenomic library
PCR productamplicon library
NGS sequencing
Introduction To Community Systems Microbiology, Aalborg 2013
Amplicon vs. Metagenomics
– less complex● better coverage● more samples
– extensive database– same fragment
● comparable phylogenetic info– PCR bias– Limited phylogenetic info
(genus/species level)– Limited functional information
Introduction To Community Systems Microbiology, Aalborg 2013
Typical resultsSmall number of highly abundant species and a long tail of rare species
Introduction To Community Systems Microbiology, Aalborg 2013
2. methodology
Introduction To Community Systems Microbiology, Aalborg 2013
16S rRNA amplicon sequencing
”Short” PCR product (70-400 bp)
Sequence reads
Sequence types (OTUs)
Classified OTUs
Quality screen and clustering
Next-gen sequencing
Matching to existing database
PCR
Sample
Introduction To Community Systems Microbiology, Aalborg 2013
16S rRNA amplicon sequencing
”Short” PCR product (70-400 bp)
Sequence reads
Sequence types (OTUs)
Classified OTUs
Quality screen and clustering
Next-gen sequencing
Matching to existing database
PCR
Sample
Introduction To Community Systems Microbiology, Aalborg 2013
Environmental surveys
Environmental sample
mixed genomic DNA
PCR productamplicon library
DNA extraction
Introduction To Community Systems Microbiology, Aalborg 2013
Hypervariable regions
E. coliV4
All life!
Introduction To Community Systems Microbiology, Aalborg 2013
PCR product
Examples of NGS sequenced amplicons
Regions denoted ”hypervariable” http://www.bioinformatics-toolkit.org
454 = 400 bpIllumina = 250 bp
Introduction To Community Systems Microbiology, Aalborg 2013
Multiplexing
Normal PCR product
NGS-adapted PCR product
Barcoded PCR product for NGS
Introduction To Community Systems Microbiology, Aalborg 2013
Multiplexing
…GCCATCAG GATCT CNACGCGAAGAACCTTANC NNNNNNNNNN…
…GCCATCAG ATCAG CNACGCGAAGAACCTTANC NNNNNNNNNN…
…GCCATCAG CACTG CNACGCGAAGAACCTTANC NNNNNNNNNN…
…GCCATCAG CTGTG CNACGCGAAGAACCTTANC NNNNNNNNNN…
adaptor barcode Primer sequence Amplified sequence
Sample 1
Sample 2
Sample 3
Sample 4
Introduction To Community Systems Microbiology, Aalborg 2013
16S rRNA amplicon sequencing
”Short” PCR product (70-400 bp)
Sequence reads
Sequence types (OTUs)
Classified OTUs
Quality screen and clustering
Next-gen sequencing
Matching to existing database
PCR
Sample
Introduction To Community Systems Microbiology, Aalborg 2013
Bacterial Diversity
Introduction To Community Systems Microbiology, Aalborg 2013
Clustering
Representative sequence
Introduction To Community Systems Microbiology, Aalborg 2013
Consider Errors!
• Low quality reads• PCR errors• Sequencing errors
– Particularly homopolymers ● 454 & ion torrent!
• Chimeras
Introduction To Community Systems Microbiology, Aalborg 2013
16S rRNA amplicon sequencing
”Short” PCR product (70-400 bp)
Sequence reads
Sequence types (OTUs)
Classified OTUs
Quality screen and clustering
Next-gen sequencing
Matching to existing database
PCR
Sample
Introduction To Community Systems Microbiology, Aalborg 2013
Functional information
• Isolates• Functional 16S rRNA work
– Stable isotope probing
• Metagenomics• In situ studies
– Microscopy – eg. inclusion bodies– Microautoradiography– Raman or NanoSIMS
Introduction To Community Systems Microbiology, Aalborg 2013
Taxonomic assignment
Genus A
Genus B
Genus C
Family
Introduction To Community Systems Microbiology, Aalborg 2013
Taxonomic assignment
Genus A
Genus B
Genus C
Family
Introduction To Community Systems Microbiology, Aalborg 2013
Taxonomic assignment
Genus A
Genus B
Genus C
Family
Introduction To Community Systems Microbiology, Aalborg 2013
Taxonomic assignment
Genus A
Genus B
Genus C
Family
Introduction To Community Systems Microbiology, Aalborg 2013
Using a ”Classifier”
• Uses an existing phylogeny• Find best unambiguous match to
references
Introduction To Community Systems Microbiology, Aalborg 2013
Classification results
Introduction To Community Systems Microbiology, Aalborg 2013AccumulibacterOther Bacteria
Radioactive acetate
Radioactive phosphate
Microautoradiography
Introduction To Community Systems Microbiology, Aalborg 2013
Metagenomics
Simon McIllroy, MicrothrixISME J 7 (6): 1161
Introduction To Community Systems Microbiology, Aalborg 2013
Substrate specificity
Accumulibacter Tetraspheara Competibacter Defluvicoccus
acetate + - + +
propionate + - - +
Casamino acids
Glu only + - -
Nitrate reduction
+/- +/- +/- -
Determined by FISH-MAR
Introduction To Community Systems Microbiology, Aalborg 2013
Greengenes assignment
species (OTU) Class Order Family Genus
OTU1 Betaproteobacteria Nitrosomonadales Nitrosomonadaceae
Nitrosomonas
OTU2 Nitrospira Nitrospirales Nitrospiraceae Nitrospira
OTU3 Betaproteobacteria Rhodocyclales Rhodocyclaceae Propionivibrio
OTU4 Betaproteobacteria Gallionellales Gallionellaceae ??
OTU5Gammaproteobacte
ria ?? ?? ??
Introduction To Community Systems Microbiology, Aalborg 2013
midasfieldguide.orgmidasfieldguide.org
52 core genera 64% organisms DK41% globally
+ others
Introduction To Community Systems Microbiology, Aalborg 2013
MIDAS curated assignment
species (OTU) Class Order Family Genus
OTU1 Betaproteobacteria Nitrosomonadales Nitrosomonadaceae Nitrosomonas
OTU2 Nitrospira Nitrospirales Nitrospiraceae Nitrospira
OTU3 Betaproteobacteria Rhodocyclales Rhodocyclaceae Accumulibacter
OTU4 Betaproteobacteria Gallionellales Gallionellaceae Nitrotoga
OTU5 Gammaproteobacteria Competibacterales Competibacteraceae Competibacter
Introduction To Community Systems Microbiology, Aalborg 2013
genus
Nitrite-oxidisers
0.3
0.2
0.1
0
Perc
ent
abundance
Introduction To Community Systems Microbiology, Aalborg 2013
Analysis Tools
● Qiime (qiime.org)
● Mothur (mothur.org)
● R (r-project.org) – Phyloseq package
(http://joey711.github.io/phyloseq/)
Introduction To Community Systems Microbiology, Aalborg 2013
● Analysis Example●
● http://nbviewer.ipython.org/7213441
Introduction To Community Systems Microbiology, Aalborg 2013
Comparing Bacterial diversity of Wastewater and activated sludge
Location Sample type Replicate
Aalborg East activated sludge AAE-1
Aalborg East activated sludge AAE-2
Aalborg East wastewater AAE-1
Aalborg East wastewater AAE-2
Aalborg West activated sludge AAW-1
Aalborg West activated sludge AAW-2
Aalborg West wastewater AAW-1
Aalborg West wastewater AAW-2
Hjoerring activated sludge HJO-1
Hjoerring activated sludge HJO-2
Hjoerring wastewater HJO-1
Hjoerring wastewater HJO-2
Introduction To Community Systems Microbiology, Aalborg 2013
Phylum level Classification
AS WW
ProteobacteriaFirmicutesBacteroidetes
Introduction To Community Systems Microbiology, Aalborg 2013
AS WW
Genus level Classification
Introduction To Community Systems Microbiology, Aalborg 2013
Alpha Diversity
• How diverse is the community in my sample?
• Richness– How many organisms are in my
sample?
• Evenness– How evenly are they distributed?
Introduction To Community Systems Microbiology, Aalborg 2013
Richness estimates
Introduction To Community Systems Microbiology, Aalborg 2013
Introduction To Community Systems Microbiology, Aalborg 2013
Beta Diversity
• How similar/different is my sample from other samples?
Introduction To Community Systems Microbiology, Aalborg 2013
Clustering
Introduction To Community Systems Microbiology, Aalborg 2013
Principal components analysis
Introduction To Community Systems Microbiology, Aalborg 2013
Label with environmental data
Introduction To Community Systems Microbiology, Aalborg 2013
Amplicon Sequencing
• Pros:– ”cheap”, many samples– 16S: good database for classification
• Cons:– PCR and primer bias– limited functional information