51
Sponsored by: Participating Experts: Daniel Turner, Ph.D. Wellcome Trust Sanger Institute, Cambridge, UK Webinar Series Webinar Series Science Science DNA Target DNA Target 10 June, 2009 10 June, 2009 Brought to you by the Science/AAAS Business Office Kelly Frazer, Ph.D. Scripps Genomic Medicine San Diego, CA Enrichment Strategies Enrichment Strategies www.opengenomics.com/SureSelect

Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Sponsored by:

Participating Experts:

Daniel Turner, Ph.D.Wellcome Trust Sanger Institute,Cambridge, UK

Webinar SeriesWebinar SeriesScienceScienceDNA Target DNA Target 10 June, 200910 June, 2009

Brought to you by the Science/AAAS Business Office

Kelly Frazer, Ph.D.Scripps Genomic MedicineSan Diego, CA

Enrichment StrategiesEnrichment Strategies

www.opengenomics.com/SureSelect

Page 2: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Daniel J TurnerHead of Sequencing Technology Development

Wellcome Trust Sanger Institute

DNA Target EnrichmentStrategies – bringing efficiencies

to genome sequencing

Page 3: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Target enrichment strategies

PCR

on array

in solution

Page 4: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Target enrichment strategies

PCR

on array

in solution

• Design primers that are specific for the region of interest

• Amplify

• Sequence

Page 5: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

XR

1,438 samples

57 populations

Population sequencing of ACTN3

Page 6: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

The α-actinin-3 deficiency trade-off:

Compared to R577 homozygotes, R557X homozygotes have:

• lower muscle strength and mass

• reduced capacity for rapid energy generation

MacArthur et al. 2007. Nature Genetics 39:1261-1265MacArthur et al. 2008 Hum Mol Genet 17:1076-86

• increased endurance capacity

• increased fatigue recovery

• enhanced muscle metabolic efficiency

Page 7: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Acoustic shearing

96-well library prep

ACTN3 CTSF

25 kb

Quail et al. (2008) Nat. Methods 5, 1005-1010

SPRI bead clean-ups

Custom adapters and barcoded PCR primers

Sequencing Strategy

Page 8: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

lanes 1,3,5,7 lanes 2,6,8

Sequencing Strategy

Page 9: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Uniformity of coverage

0

5000

10000

15000

20000

25000

30000

35000

40000

2200 2250 2300 2350 2400 2450 2500 2550 2600 2650 2700 2750 2800 2850 2900 2950 3000 3050 3100 3150 3200 3250 3300 3350 3400 3450

0

5000

10000

15000

20000

25000

30000

11800 11900 12000 12100 12200 12300 12400 12500 12600 12700 12800 12900 13000 13100 13200 13300 13400

Fragment 2

Fragment 8

0

5000

10000

15000

20000

25000

15750 15850 15950 16050 16150 16250 16350 16450 16550 16650 16750 16850 16950 17050 17150 17250 17350 17450

Fragment 10

• Uniformity is governed by the accuracy of pooling

• 80% with a coverage within 2-fold range of the median

Page 10: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

• 99.9% accuracy for genotype calling

• 63 high-confidence SNPs identified, 27 of them novel and 23 rare.

• Analysis of non-European HapMap samples and HGDP samples ongoing.

Results

Page 11: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Target enrichment strategies

PCR

on array

in solution

• Limit of 5–20 kb per PCR

• Difficult to multiplex, optimise and normalise

• Uses a lot of DNA

• Expensive if multiplexing

• But very effective

Page 12: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Target enrichment strategies

PCR

on array

in solution

• Hybridise sample DNA to target-specific probes on a microarray

• Wash to remove background

• Elute

• Sequence

Page 13: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Target enrichment strategies

PCR

on array

in solution

• Hybridise sample DNA to target-specific probes in solution

• Capture probe / target

• Wash to remove background

• Elute

• Sequence

Page 14: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

gDNA Fragmentation

Target size: 100-300bpTarget size: 100-400bp

• Shorter fragments hybridize more efficiently

• Optimized settings give tighter distribution of fragment sizes

Page 15: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Library purification

SPRI beads: easily automated

allow elution in a wider variety of buffers

Page 16: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

PCR and GC bias

Without PCR prior to hybridization

a. b.10 30 40 50 60

GC content (%)

0

80

60

40

20

0 20 10040 60 80

Percentile of unique sequence ordered by GC content

0 20 100806040

Percentile of unique sequence ordered by GC content

10 30 40 50 60 70

GC content (%)

0

80

60

40

20

With PCR prior to hybridization

Page 17: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

• Completeness: % of target bases covered by >= 1 sequence read

• Specificity: % of sequences mapping to target regions

• Uniformity: variation in coverage

Evaluation parameters

Page 18: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Completeness

On array ~ 98.6% of targeted bases

In solution ~ 99.5% of targeted bases

PCR =< 100% of targeted bases

Page 19: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Specificity

On array up to 70% on target

In solution up to 80% on target

PCR up to 100% on target

Page 20: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

On array 90% of CTR at 30x

In solution 95% of CTR at 30x

90%

95%

100%

0 10 20 30Coverage (-fold)

% o

f CTR

bas

es 14M7.5M6.8M6.5M6.2M5.8MArray 6.5M

Sequence uniformity

Page 21: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

%GC vs %Coverage

Page 22: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Target enrichment strategies

PCR

on array

in solution

• enables large-scale projects, which would not be realistic with PCR

• Not easily scalable

• Requires expensive hardware

Page 23: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Target enrichment strategies

PCR

on array

in solution

• enables large-scale projects, which would not be realistic with PCR

• Simple & relatively rapid to perform

• Scalable & easily automated

• Uses least DNA

• Requires expensivehardware

• No whole exome set available commercially

Page 24: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

AcknowledgementsLira MamanovaCarol Scott

Iwanka KozarewaDaniel MacArthurChris Tyler-SmithQasim AyubLiz Huckle

Alison CoffeyEleanor HowardAarno Palotie

Wellcome Trust Sanger Institute

Emily LeProustFred Ernani

Agilent Technologies

Tom AlbertHeike FieglerGreg McGuiness

Nimblegen

Page 25: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Sponsored by:

Participating Experts:

Daniel Turner, Ph.D.Wellcome Trust Sanger Institute,Cambridge, UK

Webinar SeriesWebinar SeriesScienceScienceDNA Target DNA Target 10 June, 200910 June, 2009

Brought to you by the Science/AAAS Business Office

Kelly Frazer, Ph.D.Scripps Genomic MedicineSan Diego, CA

Enrichment StrategiesEnrichment Strategies

www.opengenomics.com/SureSelect

Page 26: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Enrichment of sequencing targets from the human genome

Kelly A Frazer, PhDDirector, Genomic BiologyScripps Genomic Medicine

June 10, 2009

Page 27: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

genomic DNA

select regions

What is targeted sequencing?

Define sequence targets

Target enriched samples

Sequence

Page 28: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Next‐Gen Sequencing

• Low costs for generating raw, per nucleotide sequence, ($0.00001 per base).

• Best suited for generating large amounts of raw sequence data per sample, (109nucleotides per day).  

Still too costly and too low through‐put to perform whole‐genome sequencing for on many different DNA samples

Why perform targeted sequencing?

Page 29: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

To efficiently use current technologies for population‐based sequencing studies, it is necessary to enrich for specific loci in the human genome.

Page 30: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Population Sequence Studies 

• Sequence‐based association studies

Healthy elderly cohort versus individuals with age‐related diseases

• Functional annotation of genomic intervals

9p21 interval associated  with CAD and T2D

Page 31: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

• PCR – enriches target sequences with high specificity but difficult to scale

• Hybridization based methods – long oligonucleotides in solution allow for efficient capture of ~3.5 Mb of sequence targets

• Microdroplet PCR – encapsulation of PCR reactions allows for simultaneous amplification of ~4,000  targeted elements

Sample enrichment methods

Page 32: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Important parameters • Efficiency of assay design

– The fraction of targeted base pairs for which an assay can be designed

• Specificity of target enrichment– The fraction of high quality reads that map directly on the targeted sequences

• Coverage uniformity across targeted sequences– If coverage differs greatly then one has to sequence deeply to adequately cover underrepresented bases

• Reproducibility across technical replicates & samples

• Systematic allelic biases resulting in drop‐out effects– Errors of this nature result in high rates of incorrectly called heterozygous variant sites

Page 33: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Target Enrichment by Solution Hybridization

sdfsdfsdf Make Genomic DNA Fragment Libraries

Agilent Microarray 

‐ synthesis 120‐mer oligonucleotides

‐ convert to biotinylated RNA capture probes

‐ hybridization with DNA 

‐ capture and wash

‐ elution and PCR amplify

‐ sequence targeted sequences 

Page 34: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

3.6 Mb of Targeted Sequences

• 624 genes– 9,215 exons

– 4,886 evolutionarily conserved sequences (ECS) 

– total 3.2 Mb of sequence

• 3 Contiguous Regions– 9p21: 196 kb

– APOE: 100 kb

– 8q24.21: 125 kb

Page 35: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Probe design efficiency

(a)

(b)

genes

Repeat Mask

Probes

Chr9

CDKN2ACDKN2BAS

CDKN2BC9orf53

21950000 21960000 21970000 21980000 21990000 22000000 22010000 22020000 22030000 22040000 22050000 22060000 22070000 22080000 22090000 22100000 22110000 22120000 22130000

CDKN2A

CDKN2BASCDKN2B

21960000 21965000 21970000 21975000 21980000 21985000 21990000 21995000 22000000

FOXO1 gene

Repeat Mask

ECS Block

Probes

Chr13

ECS Signal

• 622 genes – CDS (97%)  UTR (88%)  ECS (86%)

• Three genomic intervals – 37% to 55%

Page 36: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Specificity of target enrichment 

38.6% map directly on target47.8% map on or near target (+/‐ 150 bp)

Percent of base pairs corresponding to filtered reads 

Page 37: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Coverage uniformity across targeted sequences

Normalized coverage – divided the observed coverage of each base by the mean coverage of all targeted bases

88.4% of all bases fell within ¼ to 4 times the mean coverage

98.3% of all bases covered by at least one read

Page 38: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Reproducibility of coverage

Technical replicates r2 ~0.95

Page 39: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Variant calling accuracycomparison to microarray genotypes

~ 4,100 SNPs

QS >= 30  detection rate = 93% concordance rate = 99.3% 

No systematic allelic biases

Page 40: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Solution hybridization‐based method is well suited for the enrichment of loci in the mega‐base‐pair scale from the human genome for population sequence studies

Page 41: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Microdroplet PCR Workflow

Primer library – up to 4000 different elements

Fragmented genomic DNA template

Page 42: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Primer design efficiency

• 47 genes – 435 exons– 29 from ENCODE intervals

– 8 TRP channel superfamily

– 11 deep venous thrombosis 

• 457 amplicons of varying sizes (119‐956 bp) and GC content (33‐74%)

Successfully design PCR assays for all exons

Page 43: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Specificity of target enrichment 

• 78% of filtered reads successfully mapped to a targeted amplicon

• Off target reads aligned across genome  in a random fashion ‐ suggesting that background sequence is due to non‐specific genomic DNA carryover rather then from off‐target amplification

Page 44: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Coverage uniformity across targeted sequences

Normalized coverage – divided the observed coverage of each base by the mean coverage of all targeted bases

89.6% of all bases fell within ¼ to 4 times the mean coverage

99.6% of all bases covered by at least one read

Only one ampliconcompletely failed

Page 45: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Reproducibility of coverage

Sample to sample r2 ~0.96

Page 46: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Variant calling accuracycomparison to microarray genotypes

~ 450 SNPs

QS >= 30  detection rate = 97.6% concordance rate = 99.1% 

Accuracy was similar in ENCODE versus non‐ENCODE interval variants and between samples of African and European ancestry  indicating that allelic biases are mimimal

Page 47: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

The microdroplet PCR process is extremely efficient with almost 100% of all primer pairs successful.  The data generated is well suited for performing population‐based sequence studies.

Page 48: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Selecting a method

• Study design– Known functional elements or entire intervals

– Total amount of targeted sequences

– Number of samples

• Sequencing Technology

Page 49: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

AcknowledgementsSTSI/Scripps Genomic Medicine

Ryan Tewhey

Kazu Nakano

Wendy Wang

Sarah Murray

Olivier Harismendy

Eric Topol

Page 50: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Sponsored by:

Participating Experts:

Daniel Turner, Ph.D.Wellcome Trust Sanger Institute,Cambridge, UK

Webinar SeriesWebinar SeriesScienceScienceDNA Target DNA Target 10 June, 200910 June, 2009

Brought to you by the Science/AAAS Business Office

Kelly Frazer, Ph.D.Scripps Genomic MedicineSan Diego, CA

Enrichment StrategiesEnrichment Strategies

www.opengenomics.com/SureSelect

Page 51: Science WWebinar Seriesebinar Series DNA Target … slides...sdfsdfsdf Make Genomic DNA Fragment Libraries Agilent Microarray ‐synthesis 120‐mer oligonucleotides ‐convert to

Look out for more webinars in the series at:

www.sciencemag.org/webinar

For related information on this webinar topic, go to:

www.opengenomics.com/SureSelect

To provide feedback on this webinar, please e‐mail

your comments to [email protected]

Sponsored by:

Webinar SeriesWebinar SeriesScienceScienceDNA Target DNA Target 10 June, 200910 June, 2009

Brought to you by the Science/AAAS Business Office

Enrichment StrategiesEnrichment Strategies