1
Next Generation Sequencing of Difficult to Lyse Bacterial and Archaeal Genomes: High Quantity or Quality DNA Not Required Rachel R. Spurbeck 1* , Sukhinder K. Sandhu 1 , Caitlin Miller 2 , Michael Manzella 2 , Kazem Kashefi 2 , Tim Harkins 1 , Laurie Kurihara 1 , and Vladimir Makarov 1 . 1 Swift Biosciences Inc., Ann Arbor, MI. 2 Michigan State University, East Lansing, MI. Library Percent Aligned Percent Duplication Estimated Library Size Accel-NGS 2S Bead Beating 99.5 0.32 1,007,885,012 Accel-NGS 1S Bead Beating 99.4 0.34 737,345,399 Accel-NGS 1S NaOH 98.8 1.11 135,488,806 Extraction Method Qubit® (ng/μl) Nanodrop™ (ng/µl) Bead Beating 3.1 5.5 NaOH < 2 107.3 Accel-NGS 2S Bead Beating Accel-NGS 1S Bead Beating Accel-NGS 1S NaOH Fold-Coverage: 58.6x 65.5x 52.9x Number of Contigs: 45 42 46 Total Consensus: 1894793 1896447 1892667 Largest Contig: 190927 190702 190844 N 50 Contig Size: 91965 85449 86622 Number of Subsystems: 290 292 291 Number of Coding Sequences: 1734 1730 1727 Number of RNAs: 58 62 60 Strain Hulk 4021 de novo Assembly Stats Library Method: Accel-NGS 2S Accel-NGS 1S Number of Reads Assembled: 1,943,313 1,946,669 Fold-Coverage: 84.14 72.99 Number of Contigs: 14 13 Total Consensus: 2,013,161 2,013,050 Largest Contig: 548,840 548,803 N50 Contig Size: 347,461 347,333 Number of Subsystems: 217 216 Number of Coding Sequences: 2,127 2,125 Number of RNAs: 50 50 www.swiftbiosci.com © 2015, Swift Biosciences, Inc. The Swift logo, Accel-NGS, and Adaptase are trademarks of Swift Biosciences. Accel-NGS is for Research Use Only. Illumina and TruSeq are trademarks of Illumina, Inc. Qubit and Ion Torrent are registered trademarks of Life Technologies, Inc. Nanodrop is a trademark of Thermo Fisher Scientific. 15-0297 05/15. Common lysis methods for DNA extraction for Next Generation Sequencing (NGS) can be ineffectual due to thick cell walls, S-layers, and other structures that protect microbes from extreme conditions causing insufficient or too low of quality DNA for NGS, resulting in poor sequencing data and incomplete genome coverage. Here we describe two distinct methods for NGS library preparation to successfully sequence difficult to lyse bacteria and archaea. The first method repairs both ends of fragmented double stranded DNA (dsDNA) allowing as little as 10 pg to be utilized. The second adapts single-stranded DNA (ssDNA), making it possible to sequence even after the use of harsh chemical and physical lysis methods. Harsh methods can maximize recovery but produce damaged DNA that typically has been difficult to use in NGS. To demonstrate that these methods enable sequencing of difficult to lyse organisms, DNA was isolated from three organisms that were lysed by three different methods: Eggerthella sp. HPP0013 and Facklamia sp. HGF4 were isolated by both bead beating and NaOH boiling, and Archaea str. Hulk 4021 by hot phenol extraction. NGS libraries were prepared by both the dsDNA and ssDNA methods from the DNA isolated by either bead beating or hot phenol. The DNA extracted by NaOH was denatured, and only the ssDNA library preparation method could be utilized. All Eggerthella libraries were sequenced and aligned to the Eggerthella sp. HGA1 with greater than 98% of reads aligning to the genome demonstrating that the ssDNA library prepared from the NaOH extracted DNA was a high quality and complex library. The Facklamia isolate was assembled de novo, with the number of contigs serving as a proxy for complete genome recovery. All assemblies had a consensus sequence of 1.89 Mbp, comprised of 42-46 contigs, with an average coverage of 53-66x demonstrating that the libraries were complex and comparable between the ds- and ssDNA methods. The DNA extracted from str. Hulk, which is an obligate Fe(III) oxide reducing archaeon, was contaminated with iron oxide. However, both the ds- and ssDNA library preparation methods produced libraries that were sequenced and assembled resulting in 10-14 contigs. These results demonstrate that these library preparation kits enable sequencing of organisms for which harsh extraction methods are necessary. Abstract Two Technologies for All Library Preparation Needs Fast, 2 hour turnaround Input range of 10 pg to 1 μg Compatible with either Illumina® or Ion Torrent Adapts single-stranded DNA Accel-NGS™ 1S Plus PCR-free libraries from 100 ng DNA input PCR-free libraries from 10 ng DNA input with library pooling Higher library complexity than TruSeq™ from 10X less input DNA Even sequencing coverage: minimal bias for AT- or GC-rich sequences Input range of 10 pg – 1 μg Accel-NGS 2S Many organisms have surface structures that prevent or inhibit gDNA extraction by conventional methods including: - cell wall - capsule - crystalline surface layer proteins - cellulose Mechanisms to extract gDNA that overcome these barriers can lead to denaturation of the DNA or fragmentation: - alkaline lysis methods such as NaOH boiling Breaking Down the Barriers to DNA: Harsh Extraction Methods for Difficult Samples Boiling with sodium hydroxide is fast (finished in under 1 hr), and inexpensive Boiling with NaOH breaks open more cells allowing for a higher yield of DNA DNA extracted by NaOH is denatured, so has to be prepared for sequencing with a ssDNA library preparation kit, such as Accel-NGS 1S DNA Library Kit. Use a Cheaper, Easier Extraction Method to Get More gDNA Out of Difficult Samples Accel-NGS 2S: bead beating Accel-NGS 1S: bead beating Accel-NGS 1S: NaOH ssDNA Library Prep Enables Sequencing of Denatured DNA from Harsh Extractions Methods : To determine if the NaOH boiling extraction method followed by Accel-NGS 1S library preparation could be used to sequence the genomes of organisms from which it is difficult to extract DNA, Eggerthella HGA1 was extracted by bead beating or NaOH boiling and libraries were prepared by Accel-NGS 2S and 1S. The libraries were then sequenced, aligned to the reference sequence, and Picard’s GCmetrics were analyzed. Conclusion : Accel-NGS 1S can be used to sequence DNA extracted by NaOH. 2S bead beating In subsystem Not in subsystem 1S bead beating In subsystem Not in subsystem 1S NaOH boiling In subsystem Not in subsystem Methods : To determine if the NaOH boiling extraction method followed by Accel-NGS 1S library preparation could be used to sequence the genomes of organisms from which it is difficult to extract DNA, Facklamia HGF4 was extracted by bead beating or NaOH boiling and libraries were prepared by Accel-NGS 2S and 1S. The libraries were then sequenced, reads were assembled de novo, and annotated using the RAST server. Conclusion : Accel-NGS 1S can be used to sequence difficult to extract microorganisms after NaOH extraction. NaOH boiling extraction coupled with Accel-NGS 1S enabled the sequencing of Facklamia HGF4 Accel-NGS 1S and 2S Produce Libraries Directly from Iron Contaminated Samples Methods : Accel-NGS 1S and Accel-NGS 2S were used to prepare libraries from an archaeon sample from the Hulk deep sea vent that could not be sequenced previously due to iron contaminating the extracted gDNA. The libraries were sequenced, the reads were assembled de novo using MIRA 4.0 and annotated using the RAST server. A) The subsystem breakdown of the genome as sequenced from the Accel-NGS 2S library. B) The subsystem breakdown of the genome as sequenced from the Accel-NGS 1S library. C) The iron contaminated sample compared to a gDNA sample without iron contamination. D) Comparison of the consensus genome assemblies constructed using the sequence data from libraries prepared by the Accel-NGS 1S and Accel-NGS 2S library preparation kits. Conclusion : Either Accel-NGS 1S or 2S can be used to directly prepare libraries for NGS whole genome sequencing from samples contaminated with iron without any extra DNA clean-up steps. Accel-NGS 1S is enabling for difficult to lyse samples: NaOH boiling extractions followed by Accel-NGS 1S library prep produced high quality sequencing data as demonstrated by re-sequencing and de novo assembly experiments. Accel-NGS 1S covers ALL DNA genomes: Bacteria, archaea and dsDNA viruses ssDNA viruses and phage Reveals more information from one metagenomic sample in a single prep: detect all species present without requiring WGA Accel-NGS 2S enables high quality WGS for samples that have extremely low DNA recovery (down to 10 pg) from gentle extraction methods. Accel-NGS products can adapt DNA containing iron contaminants. Conclusions C. A. Accel-NGS 2S B. Accel-NGS 1S Hulk by 1S vs Hulk by 2S D. Acknowledgment: Facklamia and Eggerthella DNA samples were provided by Dr. Tom Schmidt’s lab at the University of Michigan.

Next Generation Sequencing of Difficult to Lyse Bacterial ... · Next Generation Sequencing of Difficult to Lyse Bacterial and Archaeal Genomes: ... 50 50 ... • PCR-free libraries

  • Upload
    dobao

  • View
    215

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Next Generation Sequencing of Difficult to Lyse Bacterial ... · Next Generation Sequencing of Difficult to Lyse Bacterial and Archaeal Genomes: ... 50 50 ... • PCR-free libraries

Next Generation Sequencing of Difficult to Lyse Bacterial and Archaeal Genomes: High Quantity or Quality DNA Not RequiredRachel R. Spurbeck1*, Sukhinder K. Sandhu1, Caitlin Miller2, Michael Manzella2, Kazem Kashefi2, Tim Harkins1, Laurie Kurihara1, and Vladimir Makarov1.

1Swift Biosciences Inc., Ann Arbor, MI. 2Michigan State University, East Lansing, MI.

Library Percent Aligned Percent Duplication Estimated Library SizeAccel-NGS 2S Bead Beating 99.5 0.32 1,007,885,012Accel-NGS 1S Bead Beating 99.4 0.34 737,345,399

Accel-NGS 1S NaOH 98.8 1.11 135,488,806

Extraction Method Qubit® (ng/µl) Nanodrop™ (ng/µl)

Bead Beating 3.1 5.5

NaOH < 2 107.3

Accel-NGS 2S Bead Beating

Accel-NGS 1S Bead Beating

Accel-NGS 1S NaOH

Fold-Coverage: 58.6x 65.5x 52.9xNumber of Contigs: 45 42 46Total Consensus: 1894793 1896447 1892667

Largest Contig: 190927 190702 190844

N50 Contig Size: 91965 85449 86622

Number of Subsystems:

290 292 291

Number of Coding Sequences:

1734 1730 1727

Number of RNAs: 58 62 60

Strain Hulk 4021 de novo Assembly Stats

Library Method: Accel-NGS 2S Accel-NGS 1S

Number of Reads Assembled: 1,943,313 1,946,669

Fold-Coverage: 84.14 72.99

Number of Contigs: 14 13

Total Consensus: 2,013,161 2,013,050

Largest Contig: 548,840 548,803

N50 Contig Size: 347,461 347,333

Number of Subsystems: 217 216

Number of Coding Sequences: 2,127 2,125

Number of RNAs: 50 50

www.swiftbiosci.com © 2015, Swift Biosciences, Inc. The Swift logo, Accel-NGS, and Adaptase are trademarks of Swift Biosciences. Accel-NGS is for Research Use Only. Illumina and TruSeq are trademarks of Illumina, Inc. Qubit and Ion Torrent are registered trademarks of Life Technologies, Inc. Nanodrop is a trademark of Thermo Fisher Scientific. 15-0297 05/15.

Common lysis methods for DNA extraction for Next Generation Sequencing (NGS) can be ineffectual dueto thick cell walls, S-layers, and other structures that protect microbes from extreme conditions causinginsufficient or too low of quality DNA for NGS, resulting in poor sequencing data and incomplete genomecoverage. Here we describe two distinct methods for NGS library preparation to successfully sequencedifficult to lyse bacteria and archaea.

The first method repairs both ends of fragmented double stranded DNA (dsDNA) allowing as little as 10pg to be utilized. The second adapts single-stranded DNA (ssDNA), making it possible to sequence evenafter the use of harsh chemical and physical lysis methods. Harsh methods can maximize recovery butproduce damaged DNA that typically has been difficult to use in NGS.

To demonstrate that these methods enable sequencing of difficult to lyse organisms, DNA was isolatedfrom three organisms that were lysed by three different methods: Eggerthella sp. HPP0013 andFacklamia sp. HGF4 were isolated by both bead beating and NaOH boiling, and Archaea str. Hulk 4021 byhot phenol extraction. NGS libraries were prepared by both the dsDNA and ssDNA methods from theDNA isolated by either bead beating or hot phenol. The DNA extracted by NaOH was denatured, andonly the ssDNA library preparation method could be utilized. All Eggerthella libraries were sequencedand aligned to the Eggerthella sp. HGA1 with greater than 98% of reads aligning to the genomedemonstrating that the ssDNA library prepared from the NaOH extracted DNA was a high quality andcomplex library. The Facklamia isolate was assembled de novo, with the number of contigs serving as aproxy for complete genome recovery. All assemblies had a consensus sequence of 1.89 Mbp, comprisedof 42-46 contigs, with an average coverage of 53-66x demonstrating that the libraries were complex andcomparable between the ds- and ssDNA methods. The DNA extracted from str. Hulk, which is an obligateFe(III) oxide reducing archaeon, was contaminated with iron oxide. However, both the ds- and ssDNAlibrary preparation methods produced libraries that were sequenced and assembled resulting in 10-14contigs. These results demonstrate that these library preparation kits enable sequencing of organisms forwhich harsh extraction methods are necessary.

Abstract

Two Technologies for All Library Preparation Needs

• Fast, 2 hour turnaround

• Input range of 10 pg to 1 μg

• Compatible with either Illumina® or Ion Torrent

• Adapts single-stranded DNA

Accel-NGS™ 1S Plus

• PCR-free libraries from 100 ng DNA input

• PCR-free libraries from 10 ng DNA input with library pooling

• Higher library complexity than TruSeq™ from 10X less input DNA

• Even sequencing coverage: minimal bias for AT- or GC-rich sequences

• Input range of 10 pg – 1 µg

Accel-NGS 2S

Many organisms have surface structures that prevent or inhibit gDNA extraction by conventional methods including:

- cell wall- capsule- crystalline surface layer proteins- cellulose

Mechanisms to extract gDNA that overcome these barriers can lead to denaturation of the DNA or fragmentation:

- alkaline lysis methods such as NaOH boiling

Breaking Down the Barriers to DNA: Harsh Extraction Methods for Difficult Samples

•Boiling with sodium hydroxide is fast (finished in under 1 hr), and inexpensive

•Boiling with NaOH breaks open more cells allowing for a higher yield of DNA

•DNA extracted by NaOH is denatured, so has to be prepared for sequencing with a ssDNA library preparation kit, such as Accel-NGS 1S DNA Library Kit.

Use a Cheaper, Easier Extraction Method to Get More gDNA Out of Difficult Samples

Accel-NGS 2S: bead beating Accel-NGS 1S: bead beating Accel-NGS 1S: NaOH

ssDNA Library Prep Enables Sequencing of Denatured DNA from Harsh Extractions

Methods: To determine if the NaOH boiling extraction method followed by Accel-NGS 1S library preparation could beused to sequence the genomes of organisms from which it is difficult to extract DNA, Eggerthella HGA1 wasextracted by bead beating or NaOH boiling and libraries were prepared by Accel-NGS 2S and 1S. The libraries werethen sequenced, aligned to the reference sequence, and Picard’s GCmetrics were analyzed.Conclusion: Accel-NGS 1S can be used to sequence DNA extracted by NaOH.

2S bead beating

In subsystemNot in subsystem

1S bead beating

In subsystemNot in subsystem

1S NaOH boiling

In subsystemNot in subsystem

Methods: To determine if the NaOH boiling extraction method followed by Accel-NGS 1S library preparation could beused to sequence the genomes of organisms from which it is difficult to extract DNA, Facklamia HGF4 was extracted bybead beating or NaOH boiling and libraries were prepared by Accel-NGS 2S and 1S. The libraries were then sequenced,reads were assembled de novo, and annotated using the RAST server.Conclusion: Accel-NGS 1S can be used to sequence difficult to extract microorganisms after NaOH extraction.

NaOH boiling extraction coupled with Accel-NGS 1S enabled the sequencing of Facklamia HGF4

Accel-NGS 1S and 2S Produce Libraries Directly from Iron Contaminated Samples

Methods: Accel-NGS 1S and Accel-NGS 2S were used to prepare libraries from an archaeon samplefrom the Hulk deep sea vent that could not be sequenced previously due to iron contaminating theextracted gDNA. The libraries were sequenced, the reads were assembled de novo using MIRA 4.0and annotated using the RAST server. A) The subsystem breakdown of the genome as sequencedfrom the Accel-NGS 2S library. B) The subsystem breakdown of the genome as sequenced from theAccel-NGS 1S library. C) The iron contaminated sample compared to a gDNA sample without ironcontamination. D) Comparison of the consensus genome assemblies constructed using thesequence data from libraries prepared by the Accel-NGS 1S and Accel-NGS 2S library preparationkits.Conclusion: Either Accel-NGS 1S or 2S can be used to directly prepare libraries for NGS wholegenome sequencing from samples contaminated with iron without any extra DNA clean-up steps.

Accel-NGS 1S is enabling for difficult to lyse samples:• NaOH boiling extractions followed by Accel-NGS 1S library prep produced high quality

sequencing data as demonstrated by re-sequencing and de novo assembly experiments.

Accel-NGS 1S covers ALL DNA genomes:• Bacteria, archaea and dsDNA viruses• ssDNA viruses and phage• Reveals more information from one metagenomic sample in a single prep: detect all

species present without requiring WGA

Accel-NGS 2S enables high quality WGS for samples that have extremely lowDNA recovery (down to 10 pg) from gentle extraction methods.

Accel-NGS products can adapt DNA containing iron contaminants.

Conclusions

C.

A. Accel-NGS 2S B. Accel-NGS 1S

Hulk by 1S vs Hulk by 2S

D.

Acknowledgment: Facklamia and Eggerthella DNA samples were provided by Dr. Tom Schmidt’s lab at the University of Michigan.