2
Complete Genome Sequence of Rahel, a C1 Cluster Mycobacteriophage Fernando E. Nieto-Fernandez, a Christos Noutsos, a Jillian Nissen, a Yara Abdelsalam, a Jessica Ackloo, a Navpreet Banger, a Hason Chan, a Tarana Chittineedi, a Isabel Duplessy, a Mark Dyce, a Daesha Garrison, a Jaime Gonzalez, a Sandra John, a Imanjot Kahlon, a Tania Kumar, a Adwoa Lewis, a Karthik Madhira, a Rivka Mullokandova, a Nellie Pirzadeh, a Iman Raja, a Kevin Ram, a Ravi Ramdhari, a Rita Reddy, a Briana S. Saed, a Phalan Smith, a Steeve Sproul, a Jane Thomas, a Avia Yosse, a Jorge Morales b a Department of Biological Sciences, SUNY Old Westbury, Old Westbury, New York, USA b The City College of New York, New York, New York, USA ABSTRACT Rahel is a lytic Myoviridae bacteriophage that infects Mycobacterium smeg- matis mc 2 155. It has 1,555,955bp and 64.7% G1C content. Rahel has a circularly per- muted genome with 270 genes, 53 of them of known function, 33 tRNAs, and 1 trans- fer-messenger RNA (tmRNA). Only ve genes are coded on the reverse strand. R ahel was isolated as part of the Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) at SUNY Old Westbury. Rahel was isolated in Hicksville, Nassau County, New York (lat 40.7679, long 273.5405). It is a C1 sub- cluster lytic phage that infects Mycobacterium smegmatis mc 2 155. Clusters are groups of phages with sequence similarity over 50% of their genome (1). It was isolated by enrichment and two cycles of purication and amplication in 7H9 top agar at 37°C (2). For transmission electron microscopy (TEM), phages were collected from a high-titer lysate by high-speed centrifugation, mounted on carbon-stabilized, Formvar-coated copper TEM grids stained with uranyl acetate, and imaged with a JEOL JEM-2100 TEM (2). Rahel is a Myoviridae of 178 nm total length with an isometric head and tail of 89 nm (Fig. 1). DNA was extracted from a high-titer lysate using a Promega DNA Wizard kit with a modi ed protocol (2). Genomic DNA libraries were generated using an Ultra II FS kit (New England Biolabs [NEB], Ipswich, MA) with dual-indexed barcoding. Pooled libraries were run on an Illumina MiSeq system at the University of Pittsburgh, which yielded at least 547,000 single-end 150-base reads. Read coverage depth was 471 without Sanger nishing reactions. The reads were assembled using Newbler and Consed (3). A full genome sequence was present, as the contigs on both ends overlapped and the genome circularized. Rahel was annotated using DNA Master ( http://cobamide2.bio.pitt.edu/computer.htm) and the gene prediction tools GLIMMER v3.0 (4), GeneMark v2.5 (5), and Starterator v1.1 to determine gene start sites. tRNA and transfer- messenger RNA (tmRNA) predictions were made using ARAGORN v1.2.38 (6) and tRNAscan- SE v3.0 (7). Functional assignments were made using BLAST v2.9 (8), HHpred (9), and Phamerator (10). Default settings were used for all programs. Rahel has 155,955 bp, 64.7% G1C content, 270 open reading frames (ORFs), 33 tRNAs, 1 tmRNA, 53 genes of known function, and 182 hypothetical proteins (67.6%); only ve ORFs are coded on the reverse strand. Rahel has a circularly permuted genome, as all of the reads assembled into a large contig and there was no large buildup of reads or cover- age variation at the ends (2). When linearized, the coding region of gene 270 is disrupted. Structural genes include genes 99 (the major capsid protein) and 100 (the capsid decora- tion protein), genes 126 to 144 (the tail structural proteins, including the tail assembly chaperone), and genes 129 and 130. Rahel has a tail sheath protein gene, 126, and two baseplate wedge protein genes, 138 and 139, but a tail tube protein was not identied. Genes involved in nucleotide metabolism and DNA replication are found on the right Citation Nieto-Fernandez FE, Noutsos C, Nissen J, Abdelsalam Y, Ackloo J, Banger N, Chan H, Chittineedi T, Duplessy I, Dyce M, Garrison D, Gonzalez J, John S, Kahlon I, Kumar T, Lewis A, Madhira K, Mullokandova R, Pirzadeh N, Raja I, Ram K, Ramdhari R, Reddy R, Saed BS, Smith P, Sproul S, Thomas J, YosseA, Morales J. 2020. Complete genome sequence of Rahel, a C1 cluster mycobacteriophage. Microbiol Resour Announc 9:e01071-20. https://doi.org/10.1128/MRA.01071-20. Editor John J. Dennehy, Queens College Copyright © 2020 Nieto-Fernandez et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license. Address correspondence to Fernando E. Nieto-Fernandez, [email protected]. Received 3 October 2020 Accepted 21 October 2020 Published 5 November 2020 November 2020 Volume 9 Issue 45 e01071-20 mra.asm.org 1 GENOME SEQUENCES on May 5, 2021 by guest http://mra.asm.org/ Downloaded from

Complete Genome Sequence of Rahel, a C1 Cluster ...end of the genome from gene 147 on to the end. In Rahel, the lysis system is encoded by genes 252 and 254, lysin A and B, respectively,

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Complete Genome Sequence of Rahel, a C1 Cluster ...end of the genome from gene 147 on to the end. In Rahel, the lysis system is encoded by genes 252 and 254, lysin A and B, respectively,

Complete Genome Sequence of Rahel, a C1 ClusterMycobacteriophage

Fernando E. Nieto-Fernandez,a Christos Noutsos,a Jillian Nissen,a Yara Abdelsalam,a Jessica Ackloo,a Navpreet Banger,a

Hason Chan,a Tarana Chittineedi,a Isabel Duplessy,a Mark Dyce,a Daesha Garrison,a Jaime Gonzalez,a Sandra John,a Imanjot Kahlon,a

Tania Kumar,a Adwoa Lewis,a Karthik Madhira,a Rivka Mullokandova,a Nellie Pirzadeh,a Iman Raja,a Kevin Ram,a Ravi Ramdhari,a

Rita Reddy,a Briana S. Saed,a Phalan Smith,a Steeve Sproul,a Jane Thomas,a Avia Yossefi,a Jorge Moralesb

aDepartment of Biological Sciences, SUNY Old Westbury, Old Westbury, New York, USAbThe City College of New York, New York, New York, USA

ABSTRACT Rahel is a lytic Myoviridae bacteriophage that infects Mycobacterium smeg-matis mc2155. It has 1,555,955bp and 64.7% G1C content. Rahel has a circularly per-muted genome with 270 genes, 53 of them of known function, 33 tRNAs, and 1 trans-fer-messenger RNA (tmRNA). Only five genes are coded on the reverse strand.

Rahel was isolated as part of the Science Education Alliance-Phage Hunters AdvancingGenomics and Evolutionary Science (SEA-PHAGES) at SUNY Old Westbury. Rahel was

isolated in Hicksville, Nassau County, New York (lat 40.7679, long 273.5405). It is a C1 sub-cluster lytic phage that infects Mycobacterium smegmatis mc2155. Clusters are groups ofphages with sequence similarity over 50% of their genome (1). It was isolated by enrichmentand two cycles of purification and amplification in 7H9 top agar at 37°C (2). For transmissionelectron microscopy (TEM), phages were collected from a high-titer lysate by high-speedcentrifugation, mounted on carbon-stabilized, Formvar-coated copper TEM grids stainedwith uranyl acetate, and imaged with a JEOL JEM-2100 TEM (2). Rahel is a Myoviridae of 178nm total length with an isometric head and tail of 89nm (Fig. 1). DNA was extracted from ahigh-titer lysate using a Promega DNA Wizard kit with a modified protocol (2). GenomicDNA libraries were generated using an Ultra II FS kit (New England Biolabs [NEB], Ipswich,MA) with dual-indexed barcoding. Pooled libraries were run on an Illumina MiSeq system atthe University of Pittsburgh, which yielded at least 547,000 single-end 150-base reads. Readcoverage depth was 471� without Sanger finishing reactions. The reads were assembledusing Newbler and Consed (3). A full genome sequence was present, as the contigs on bothends overlapped and the genome circularized. Rahel was annotated using DNA Master(http://cobamide2.bio.pitt.edu/computer.htm) and the gene prediction tools GLIMMER v3.0(4), GeneMark v2.5 (5), and Starterator v1.1 to determine gene start sites. tRNA and transfer-messenger RNA (tmRNA) predictions were made using ARAGORN v1.2.38 (6) and tRNAscan-SE v3.0 (7). Functional assignments were made using BLAST v2.9 (8), HHpred (9), andPhamerator (10). Default settings were used for all programs.

Rahel has 155,955 bp, 64.7% G1C content, 270 open reading frames (ORFs), 33 tRNAs,1 tmRNA, 53 genes of known function, and 182 hypothetical proteins (67.6%); only fiveORFs are coded on the reverse strand. Rahel has a circularly permuted genome, as all ofthe reads assembled into a large contig and there was no large buildup of reads or cover-age variation at the ends (2). When linearized, the coding region of gene 270 is disrupted.Structural genes include genes 99 (the major capsid protein) and 100 (the capsid decora-tion protein), genes 126 to 144 (the tail structural proteins, including the tail assemblychaperone), and genes 129 and 130. Rahel has a tail sheath protein gene, 126, and twobaseplate wedge protein genes, 138 and 139, but a tail tube protein was not identified.Genes involved in nucleotide metabolism and DNA replication are found on the right

Citation Nieto-Fernandez FE, Noutsos C,Nissen J, Abdelsalam Y, Ackloo J, Banger N,Chan H, Chittineedi T, Duplessy I, Dyce M,Garrison D, Gonzalez J, John S, Kahlon I, KumarT, Lewis A, Madhira K, Mullokandova R,Pirzadeh N, Raja I, Ram K, Ramdhari R, Reddy R,Saed BS, Smith P, Sproul S, Thomas J, Yossefi A,Morales J. 2020. Complete genome sequenceof Rahel, a C1 cluster mycobacteriophage.Microbiol Resour Announc 9:e01071-20.https://doi.org/10.1128/MRA.01071-20.

Editor John J. Dennehy, Queens College

Copyright © 2020 Nieto-Fernandez et al. Thisis an open-access article distributed under theterms of the Creative Commons Attribution 4.0International license.

Address correspondence to Fernando E.Nieto-Fernandez, [email protected].

Received 3 October 2020Accepted 21 October 2020Published 5 November 2020

November 2020 Volume 9 Issue 45 e01071-20 mra.asm.org 1

GENOME SEQUENCES

on May 5, 2021 by guest

http://mra.asm

.org/D

ownloaded from

Page 2: Complete Genome Sequence of Rahel, a C1 Cluster ...end of the genome from gene 147 on to the end. In Rahel, the lysis system is encoded by genes 252 and 254, lysin A and B, respectively,

end of the genome from gene 147 on to the end. In Rahel, the lysis system is encodedby genes 252 and 254, lysin A and B, respectively, followed by a terminase, gene 255. Wedid not find a holin gene as part the lysis system.

Data availability. The complete sequence for Rahel’s genome can be found inGenBank under accession number MK359348 and SRA accession number SRX9162484.

ACKNOWLEDGMENTSWe thank the HHMI SEA-PHAGES program for the opportunity to be part of this project.

We are grateful to the School of Arts and Sciences at SUNY Old Westbury and theDepartment of Biology for supporting this work. We also thank the University of PittsburghSEA-PHAGES team members involved in sequencing, assembly, and quality control (QC)annotation.

REFERENCES1. Hatful G. 2014. Molecular genetics of mycobacteriophages. Microbiol

Spectr 2:1–36. https://doi.org/10.1128/microbiolspec.MGM2-0032-2013.2. Poxleitner M, Pope W, Jacobs-Sera D, Sivanathan V, Hatfull G. 2018. Phage

discovery guide. Howard Hughes Medical Institute, Chevy Chase, MD.3. Russell DA. 2018. Sequencing, assembling, and finishing complete

bacteriophage genomes. Methods Mol Biol 1681:109–125. https://doi.org/10.1007/978-1-4939-7343-9_9.

4. Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improvedmicrobial gene identification with GLIMMER. Nucleic Acids Res 27:4636–4641. https://doi.org/10.1093/nar/27.23.4636.

5. Lukashin A, Borodovsky M. 1998. GeneMark.hmm: new solutions for genefinding. Nucleic Acids Res 26:1107–1115. https://doi.org/10.1093/nar/26.4.1107.

6. Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes

and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16.https://doi.org/10.1093/nar/gkh152.

7. Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detectionof transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964.https://doi.org/10.1093/nar/25.5.955.

8. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic localalignment search tool. J Mol Biol 215:403–410. https://doi.org/10.1016/S0022-2836(05)80360-2.

9. Söding J, Biegert A, Lupas AN. 2005. The HHpred interactive server forprotein homology detection and structure prediction. Nucleic Acids Res33:W244–W248. https://doi.org/10.1093/nar/gki408.

10. Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011.Phamerator: a bioinformatic tool for comparative bacteriophage genomics.BMC Bioinformatics 12:395. https://doi.org/10.1186/1471-2105-12-395.

FIG 1 TEM of Rahel mounted on Formvar-coated copper grids stained with uranyl acetate and imagedwith a JEOL JEM-2100 microscope. Rahel’s virions are 178 nm in total length, with an isometric head andtail of 89nm (�50,000 magnification).

Nieto-Fernandez et al.

November 2020 Volume 9 Issue 45 e01071-20 mra.asm.org 2

on May 5, 2021 by guest

http://mra.asm

.org/D

ownloaded from