1
Pyramidobacter piscolens Fusobacterium mortiferum Bacteroides Parabacteroides johnsonii Oribacterium sinus Streptococcus sp. oral taxon 056 Capnocytophaga Cardiobacterium valvarum Haemophilus Firmicutes Oribacterium sp. oral taxon 108 Prevotella Proteobacteria Streptomyces sp. HGB0020 Anaerofustis stercorihominis Anaerotruncus colihominis Bacteroidetes Capnocytophaga sp. oral taxon 326 Clostridiales bacterium oral taxon 876 Hungatella hathewayi Lactobacillus Pseudoflavonifractor capillosus Actinobacteria <phylum> Actinomyces Actinomyces israelii Actinomyces sp. oral taxon 848 Burkholderia Delftia acidovorans Edwardsiella tarda Eubacterium ventriosum Fusobacterium necrophorum Klebsiella Neisseria elongata Paenibacillus Prevotella oris Providencia alcalifaciens Pseudomonas Rothia dentocariosa Succinatimonas hippei Actinomyces cardiffensis Actinomyces odontolyticus Alistipes putredinis Anaerococcus Atopobium vaginae Bacteroides coprophilus Bacteroides intestinalis Bacteroides massiliensis Bacteroides sp. 9_1_42FAA Bifidobacterium Bifidobacterium dentium Blautia obeum Capnocytophaga granulosa Catenibacterium mitsuokai Catonella morbi Clostridium Collinsella tanakaei Delftia Desulfovibrio Dorea formicigenerans Eikenella corrodens Enterococcus Enterococcus casseliflavus Enterococcus faecium Escherichia coli Fusobacterium periodonticum Kytococcus sedentarius Lactobacillus rhamnosus Listeria grayi Megasphaera Mycoplasma fermentans Neisseria sp. oral taxon 020 Ochrobactrum Oxalobacter formigenes Paenibacillus chitinolyticus Paenisporosarcina Parascardovia denticolens Peptostreptococcus stomatis Porphyromonas Prevotella maculosa Prevotella veroralis Pseudomonas protegens Ruminococcus sp. 5_1_39BFAA Sanguibacter keddieii Scardovia wiggsiae Selenomonas artemidis Shuttleworthia Staphylococcus haemolyticus Streptococcus Streptococcus sanguinis Streptomyces Treponema Turicella otitidis Turicibacter Tyzzerella nexilis Yersinia Yersinia pestis [Clostridium] hiranonis [Eubacterium] hallii [Ruminococcus] torques Functional Analysis of Microbiomes By Metaproteomics Within Galaxy Framework Pratik Jagtap 1 , James Johnson 1 , Kevin Viken 1 , Brian Sandri 1 , Bjoern Gruening 2 , Subina Mehta 1 , Thomas McGowan 1 , Alexa Pragman 1 , Maneesh Bhargava 1 , Chris Wendt 1 , Joel Rudney 1 , Tim Griffin 1 1 University of Minnesota, Minneapolis, MN, 2 University of Freiburg, Freiburg, Germany INTRODUCTION CHALLENGES Large metagenomic databases with data from multiple organisms. Peptide sharing / Protein inference across multiple organisms. Interpretation of biological function. Validation of the peptide identification. Disparate tools and multiple processing steps. Proteomics. doi: 10.1002/pmic.201500183. METAPROTEOMIC WORKFLOWS IN GALAXY-P SOLUTION: GALAXY PLATFORM Goecks J et al Genome Biol. 2010;11(8):R86. A web-based bioinformatics data analysis platform. Software accessibility and usability. Share-ability of tools, workflows and histories. Reproducibility and ability to test and compare results using multiple parameters. Ability to assimilate disparate software into integrated workflows. Software tools can be used in a sequential manner to generate analytical workflows that can be reused, shared and creatively modified. Protein Database Downloader downloads UniProt protein FASTA databases of various organisms. SUCROSE-INDUCED DYSBIOSIS IN DENTAL CARIES ( RUDNEY LAB ) Rudney et al., BMC Microbiome DOI: 10.1186/s40168-015-0136-z Paired design – dental caries microcosms were grown in presence/ absence of sucrose. Sucrose induces acid production in dental caries (dysbiosis) Twelve paired replicates were analyzed using LC- MS/MS analysis NS: Absence of sucrose WS: Presence of sucrose P: Plaque samples SEED Functional Analysis of metaproteomics data within MEGAN MEGAN6 Taxonomic Analysis Radial plot of taxa identified. Lung cancer patient tumor samples were compared to healthy adjacent tissue and control tissue samples. Complementary 16S rRNA microbiota sequencing was also performed on these samples. 232 microbial peptides were assigned to 1 0 7 InterPro2GO proteins Microbiome of fluid proximate to the site of lung injury was analyzed. The sample was acquired by using a bronchoscopy procedure. BALF from subjects with Acute Respiratory Disease (ARDS) patients and lung injury after Hematopoietic Stem Cell Transplant (HSCT) were analyzed. GO:0009058 biosynthetic process Other GO:0016853 isomerase activity GO:0044281 small molecule metabolic process GO:0006807 nitrogen compound metabolic process GO:0016740 transferase activity GO:0000166 nucleotide binding GO:0006790 sulfur compound metabolic process GO:0046872 metal ion binding Unclassified GO:0016787 hydrolase activity GO:0005975 carbohydrate metabolic process GO:0051186 cofactor metabolic process GO:0006810 transport GO:0009117 nucleotide metabolic process GO:0003676 nucleic acid binding GO:0005215 transporter activity GO:0006259 DNA metabolic process GO:0006520 cellular amino acid metabolic process GO:0016491 oxidoreductase activity GO:0055114 oxidation-reduction process GO:0006950 response to stress GO:0016070 RNA metabolic process GO:0005737 cytoplasm GO:0009056 catabolic process GO:0001071 nucleic acid binding transcription factor activity GO:0006091 generation of precursor metabolites and energy GO:0016020 membrane GO:0016310 phosphorylation GO:0016829 lyase activity GO:0051082 unfolded protein binding GO:0006412 translation GO:0016874 ligase activity GO:0048037 cofactor binding GO:0004872 receptor activity GO:0006629 lipid metabolic process GO:0016043 cellular component organization GO:0042592 homeostatic process GO:0000988 transcription factor activity, protein binding GO:0005840 ribosome GO:0007154 cell communication GO:0019538 protein metabolic process Word cloud of microbial genera present in BALF Mapping of the BALF microbial peptides to proteins and functional Gene Ontology terms provides insights into active microbial pathways. CONCLUSIONS Proteomics. doi: 10.1002/pmic.201500074 STEP INPUT TOOL OUTPUT A COMPONENTS OF METAPROTEOMICS WORKFLOW. B C Database generation URL Link of database / Taxon list / Metagenomic FASTQ Files Protein dB downloader / UniProt XML Downloader / Sixgill / Merge FASTA Merged FASTA file. Peaklist generation Thermo RAW Files. msconvert, MGF Formatter mzML and MGF files. Database Search MGF Files, Search database. ProteinPilot / SearchGUI / PeptideShaker .group file, peptide summary and PSPEP FDR report. PSM Report First-Step " Workflow with text manipulation tools. .group file, peptide summary. Second-Step MGF Files, Modified search database. Workflow with text manipulation tools. .group file, peptide summary and PSPEP FDR report. Identifying peptides from the microbial database. Peptide Summary " Peptide List. BLAST-P Search FASTA Format of the Peptide List BLAST-P and short BLAST-P workflow XML or Tabular Output for MEGAN6 Analysis Taxonomy Analysis Peptides List UniPept Genera identified and phylogenetic tree PSM Evaluation Peptide Summary, mzML files PSM Evaluator, ProtVis. Tabular or HTML Report 2 1 F D E G F G FUNCTIONAL MICROBIOME ANALYSIS 16S rRNA sucrose microcosms retained inter-individual taxonomic variation MGF Files were searched against custom databases to identify microbial peptides. Automated workflows including BLAST-P analysis were used to generate output for taxonomic and functional characterization. Taxonomy Analysis of metaproteomics data within MEGAN BRONCHO-ALVEOLAR LAVAGE FLUID METAPROTEOME ( BHARGAVA LAB ) LUNG TISSUE METAPROTEOME ( WENDT LAB ) Bronchoscopy 53 genera (notably Pyramidobacter, Orbibacterium, Fusobacterium) and 71 species (e.g. Pyramidobacter piscolens) were identified from 776 microbial peptides. IPR000652 Triosephosphate isomerase IPR012726 Thiazole biosynthesis ThiH IPR013126 Heat shock protein 70 family IPR004473 Restriction endonuclease, type I, HsdR IPR020449 Transcription regulator HTH, AraC- type IPR000600 ROK family IPR002133 S-adenosylmethionine synthetase IPR001270 ClpA/B family IPR001672 Phosphoglucose isomerase (PGI) IPR005632 Chaperone protein Skp IPR005996 Ribosomal protein L30, bacterial-type IPR011147 Bifunctional aspartokinase/homoserine dehydrogenase I IPR012094 tRNA(Ile)-lysidine synthase IPR001404 Heat shock protein Hsp90 family IPR001576 Phosphoglycerate kinase IPR002177 DNA-binding protein Dps IPR002347 Short-chain dehydrogenase/reductase SDR IPR004506 tRNA-specific 2-thiouridylase IPR004629 Glycosyl transferase WecB/TagA/CpsF IPR005259 Primosomal protein N' IPR005294 ATPase, F1 complex, alpha subunit IPR005564 Major capsid protein GpE IPR005662 GTP-binding protein Era IPR005719 Dihydroorotate dehydrogenase, class 2 IPR005722 ATPase, F1 complex, beta subunit IPR005746 Thioredoxin IPR010949 TonB-dependent haemoglobin/transferrin/lactoferrin receptor IPR016718 rRNA (guanine-N1-)-methyltransferase A, predicted IPR017821 Succinate CoA transferase IPR018320 DNA polymerase 1 IPR024706 Peroxiredoxin, AhpC-type IPR000015 Outer membrane usher protein IPR000206 Ribosomal protein L7/L12 IPR000394 RNA polymerase sigma factor 54 IPR000447 FAD-dependent glycerol-3-phosphate dehydrogenase IPR000577 Carbohydrate kinase, FGGY IPR000771 Ketose-bisphosphate aldolase, class-II IPR000941 Enolase IPR000944 Transcription regulator Rrf2-type IPR001272 Phosphoenolpyruvate carboxykinase, ATP-utilising IPR001308 Electron transfer flavoprotein alpha subunit/FixB IPR001474 GTP cyclohydrolase I IPR001645 Folylpolyglutamate synthetase IPR001851 ABC transporter, permease IPR002155 Thiolase IPR002201 Glycosyl transferase, family 9 IPR002504 NAD kinase IPR003004 GspF/PilC family IPR004387 Peptidase M50, putative membrane-associated zinc metallopeptidase IPR004396 Ribosome-binding ATPase YchF/Obg-like ATPase 1 IPR004429 Isopropylmalate dehydrogenase IPR004665 Transcription termination factor Rho IPR004730 Transaldolase type 1 IPR004763 Cation efflux system CzcA/CusA/SilA/NccA/HelA/CnrA IPR004797 Competence protein ComEC/Rec2 IPR005763 L-fucose isomerase IPR005841 Alpha-D-phosphohexomutase superfamily IPR005950 Molybdenum ABC transporter, periplasmic binding protein IPR005982 Thioredoxin reductase IPR006034 Asparaginase/glutaminase IPR006043 Xanthine/uracil/vitamin C permease IPR006101 Glycoside hydrolase, family 2 IPR006129 Adhesin B IPR006264 3-phosphoshikimate 1-carboxyvinyltransferase IPR006275 Carbamoyl-phosphate synthase, large subunit IPR006276 Cobalamin-independent methionine synthase IPR006308 DNA polymerase III, alpha subunit, Gram-positive type IPR006439 HAD hydrolase, subfamily IA IPR009948 Syd IPR010131 RND efflux system, outer membrane lipoprotein, NodT IPR011179 Isopentenyl-diphosphate delta-isomerase, FMN-dependent IPR011701 Major facilitator superfamilyIPR011825 23S rRNA (uracil(747)-C(5))-methyltransferase RlmC IPR011877 Ribokinase IPR013440 Tryptophanase IPR014046 Diadenylate cyclase IPR014101 Carotene isomerase IPR014780 tRNA pseudouridine synthase II, TruB IPR015807 Histidine-tRNA ligase IPR015963 Uridylate kinase, bacteria IPR020591 Chromosomal replication control, initiator DnaA-like IPR022801 Ribosomal protein S4/S9 IPR023468 Riboflavin kinase IPR024904 Ornithine carbamoyltransferase IPR025705 Beta-hexosaminidase IPR026569 Ribosomal protein L28/L24 IPR026892 Glycoside hydrolase family 3 IPR028362 Alginate O-acetyltransferase AlgI IPR028829 Exo-beta-D-glucosaminidase IPR029703 DNA polymerase epsilon catalytic subunit IPR030854 Ribonuclease J, bacteria IPR030963 3-dehydroquinate synthase family IPR031322 Shikimate kinase/gluconokinase IPR032816 SNARE associated Golgi protein IPR033173 Translational activator Gcn1 TAXONOMY ANALYSIS USING METAPROTEOME DATA FUNCTIONAL ANALYSIS USING METAPROTEOME DATA 71 genera (For e.g, Bacteroides, Lactobacillus) and 84 species were identified from 729 microbial peptides TAXONOMY ANALYSIS (METAPROTEOMICS) 336 microbial peptides could be assigned to 132 InterPro2GO proteins, notably the ATPase, F1 complex (19% of reads) FUNCTIONAL ANALYSIS (METAPROTEOMICS) Metaproteomic methods hold promise in unravelling the mechanistic details of microbial interactions with host/environment by offering insights into functional dynamics of the microbiome. From presentation at the 64th ASMS Annual Conference 2016 Methods SATURDAY - 604 Contact Email: [email protected] Phone: 612-816-4232 Metaproteomics characterizes proteins expressed by complex microbial communities, which in turn can enable new insights into the functional dynamics of microbiomes. Recent studies suggest that functional characterization via metaproteomic analysis might more accurately measure the real-time response of microbial communities to perturbations in their environment. Using the Galaxy-P framework, we demonstrate the use of workflows as a metaproteomic informatics solution. MEGAN5 was used to generate functional characterization of the metaproteomes using SEED pathway analysis. Taxonomically-diverse sucrose-pulsed microcosms clustered more closely by function. Function-based changes may be better indicators of dysbiosis. Statistical analysis identified proteins that were expressed at a higher abundance in presence of sugar. Selected reaction monitoring (SRM) was used to quantify conserved peptides from three enzymes whose abundance increased in sucrose-induced dysbiosis: NADP-specific glutamate dehydrogenase (EC 1.4.1.4) ; Ketol-acid reductoisomerase (EC 1.1.1.86) ; L-lactate dehydrogenase (EC 1.1.1.27). Our results demonstrate advantages of Galaxy-P for metaproteomic analysis, such as easy access to a wide variety of tools for a wide community of researchers. Metaproteomic analysis can identify functional components of diverse microbiomes such as dental plaque, lung tissue & BALF. A Metaproteomics Gateway - z.umn.edu/metaproteomicsgateway - on JetStream instance is available for metaproteomics researchers and software developers. We are currently in the process of adding new metaproteomic tools and workflows to the Galaxy-P platform. ACKNOWLEDGEMENTS: We acknowledge NSF grant 1458524, and NIH grant 1U24CA199347 for funding support to the Galaxy-P team at the University of Minnesota. Alexa Pragman is supported by VA ORD 1 IK2 CX001095-01A2 grant. We also thank the Center for Mass Spectrometry and Proteomics at the University of Minnesota.

Functional Analysis of Microbiomes By Metaproteomics ...galaxyp.org/wp-content/uploads/2017/07/Metaproteomics_ASM_201… · 336 microbial peptides could be assigned to 132 InterPro2GO

  • Upload
    others

  • View
    10

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Functional Analysis of Microbiomes By Metaproteomics ...galaxyp.org/wp-content/uploads/2017/07/Metaproteomics_ASM_201… · 336 microbial peptides could be assigned to 132 InterPro2GO

Pyramidobacter piscolensFusobacterium mortiferum

Bacteroides

Parabacteroides johnsonii

Oribacterium sinus

Streptococcus sp. oral taxon 056Capnocytophaga

Cardiobacterium valvarum

Haemophilus

Firmicutes

Oribacterium sp. oral taxon 108

PrevotellaProteobacteria

Streptomyces sp. HGB0020

Anaerofustis stercorihominis

Anaerotruncus colihominisBacteroidetesCapnocytophaga sp. oral taxon 326

Clostridiales bacterium oral taxon 876

Hungatella hathewayi

LactobacillusPseudoflavonifractor capillosus

Actinobacteria <phylum>

Actinomyces

Actinomyces israeliiActinomyces sp. oral taxon 848

Burkholderia

Delftia acidovorans

Edwardsiella tarda

Eubacterium ventriosumFusobacterium necrophorum

Klebsiella

Neisseria elongataPaenibacillusPrevotella oris Providencia alcalifaciens

Pseudomonas Rothia dentocariosa

Succinatimonas hippei

Actinomyces cardiffensis

Actinomyces odontolyticus

Alistipes putredinis

Anaerococcus

Atopobium vaginae

Bacteroides coprophilus

Bacteroides intestinalis

Bacteroides massiliensis

Bacteroides sp. 9_1_42FAA

BifidobacteriumBifidobacterium dentium

Blautia obeumCapnocytophaga granulosa

Catenibacterium mitsuokai

Catonella morbi Clostridium

Collinsella tanakaei

Delftia

Desulfovibrio Dorea formicigenerans Eikenella corrodensEnterococcus

Enterococcus casseliflavus

Enterococcus faecium

Escherichia coli

Fusobacterium periodonticum Kytococcus sedentarius Lactobacillus rhamnosus

Listeria grayi MegasphaeraMycoplasma fermentans

Neisseria sp. oral taxon 020

Ochrobactrum

Oxalobacter formigenes Paenibacillus chitinolyticus

Paenisporosarcina

Parascardovia denticolens Peptostreptococcus stomatis PorphyromonasPrevotella maculosa

Prevotella veroralis

Pseudomonas protegens Ruminococcus sp. 5_1_39BFAA

Sanguibacter keddieii

Scardovia wiggsiae

Selenomonas artemidis

Shuttleworthia Staphylococcus haemolyticus

Streptococcus

Streptococcus sanguinis

StreptomycesTreponema

Turicella otitidis

TuricibacterTyzzerella nexilis

Yersinia

Yersinia pestis

[Clostridium] hiranonis

[Eubacterium] hallii [Ruminococcus] torques

Functional Analysis of Microbiomes By Metaproteomics Within Galaxy Framework Pratik Jagtap1, James Johnson1, Kevin Viken1, Brian Sandri1, Bjoern Gruening2, Subina Mehta1, Thomas McGowan1, Alexa Pragman1, Maneesh Bhargava1, Chris Wendt1, Joel Rudney1, Tim Griffin1

1University of Minnesota, Minneapolis, MN, 2University of Freiburg, Freiburg, Germany

INTRODUCTION

CHALLENGES •  Large metagenomic databases with

data from multiple organisms. •  Peptide sharing / Protein inference

across multiple organisms. •  Interpretat ion of biological

function. •  Va l i d a t i o n o f t h e p e p t i d e

identification. •  Disparate tools and multiple

processing steps. Proteomics. doi: 10.1002/pmic.201500183.

METAPROTEOMIC WORKFLOWS IN GALAXY-P

SOLUTION: GALAXY PLATFORM

Goecks J et al Genome Biol. 2010;11(8):R86.

•  A web-based bioinformatics data analysis platform.

•  Software accessibility and usability. •  Share-ability of tools, workflows and

histories. •  Reproducibility and ability to test and

c o m p a re re s u l t s u s i n g m u l t i p l e parameters.

•  Ability to assimilate disparate software into integrated workflows.

Software tools can be used in a sequential manner to generate analytical workflows that can be reused, shared and creatively modified.

Protein Database Downloader downloads UniProt protein FASTA databases of various organisms.

SUCROSE-INDUCED DYSBIOSIS

IN DENTAL CARIES (RUDNEY LAB) Rudney et al., BMC Microbiome DOI: 10.1186/s40168-015-0136-z

•  Paired design – dental caries microcosms were g ro w n i n p re s e n c e /absence of sucrose.

•  Sucrose induces acid production in dental caries (dysbiosis)

•  Twelve paired replicates were analyzed using LC-MS/MS analysis

NS: Absence of sucrose WS: Presence of sucrose P: Plaque samples

SEED Functional Analysis of metaproteomics data within MEGAN

MEGAN6 Taxonomic Analysis Radial plot of taxa identified.

•  Lung cancer patient tumor samples were compared to healthy adjacent tissue and control tissue samples.

•  Complementary 16S rRNA microbiota sequencing was also performed on these samples.

232 microbial peptides were assigned to 1 0 7 InterPro2GO proteins

• M i c r o b i o m e o f f l u i d proximate to the site of lung injury was analyzed. The sample was acquired by using a bronchoscopy procedure.

• BALF from subjects with

Acute Respiratory Disease (ARDS) patients and lung injury after Hematopoietic Stem Cell Transplant (HSCT) were analyzed.

GO:0009058 biosynthetic processOther

GO:0016853 isomerase activityGO:0044281 small molecule metabolic process

GO:0006807 nitrogen compound metabolic process

GO:0016740 transferase activity

GO:0000166 nucleotide binding

GO:0006790 sulfur compound metabolic process

GO:0046872 metal ion binding

Unclassified

GO:0016787 hydrolase activity GO:0005975 carbohydrate metabolic process

GO:0051186 cofactor metabolic process

GO:0006810 transportGO:0009117 nucleotide metabolic process

GO:0003676 nucleic acid bindingGO:0005215 transporter activity

GO:0006259 DNA metabolic process

GO:0006520 cellular amino acid metabolic process

GO:0016491 oxidoreductase activityGO:0055114 oxidation-reduction process

GO:0006950 response to stress GO:0016070 RNA metabolic process

GO:0005737 cytoplasm

GO:0009056 catabolic process

GO:0001071 nucleic acid binding transcription factor activity

GO:0006091 generation of precursor metabolites and energy GO:0016020 membrane

GO:0016310 phosphorylation

GO:0016829 lyase activity

GO:0051082 unfolded protein binding

GO:0006412 translation

GO:0016874 ligase activityGO:0048037 cofactor bindingGO:0004872 receptor activityGO:0006629 lipid metabolic processGO:0016043 cellular component organizationGO:0042592 homeostatic process

GO:0000988 transcription factor activity, protein binding

GO:0005840 ribosomeGO:0007154 cell communicationGO:0019538 protein metabolic process

Word cloud of microbial genera present in BALF

Mapping of the BALF microbial peptides to

proteins and functional Gene Ontology terms provides insights into

active microbial pathways.

CONCLUSIONS

Proteomics. doi: 10.1002/pmic.201500074

STEP INPUT TOOL OUTPUT

A

COMPONENTS OF METAPROTEOMICS WORKFLOW.

BC

Database generation URL Link of database / Taxon list / Metagenomic FASTQ Files

Protein dB downloader / UniProt XML Downloader / Sixgill / Merge FASTA

Merged FASTA file.

Peaklist generation Thermo RAW Files. msconvert, MGF Formatter

mzML and MGF files.

Database Search

MGF Files, Search database.

ProteinPilot / SearchGUI / PeptideShaker

.group file, peptide summary and PSPEP FDR report. PSM Report

First-Step " Workflow with text manipulation tools.

.group file, peptide summary.

Second-Step MGF Files, Modified search database.

Workflow with text manipulation tools.

.group file, peptide summary and PSPEP FDR report.

Identifying peptides from the microbial database.

Peptide Summary " Peptide List.

BLAST-P Search FASTA Format of the Peptide List BLAST-P and short BLAST-P workflow

XML or Tabular Output for MEGAN6 Analysis

Taxonomy Analysis

Peptides List

UniPept

Genera identified and phylogenetic tree

PSM Evaluation

Peptide Summary, mzML files

PSM Evaluator, ProtVis.

Tabular or HTML Report

21

F

D

E

G

F

G

FUNCTIONAL MICROBIOME ANALYSIS

•  16S rRNA sucrose microcosms retained inter-individual taxonomic variation

•  MGF Files were searched against custom databases to identify microbial peptides.

•  Automated workflows including BLAST-P analysis were used to generate output for taxonomic and functional characterization.

Taxonomy Analysis of metaproteomics data within MEGAN

BRONCHO-ALVEOLAR LAVAGE FLUID

METAPROTEOME (BHARGAVA LAB)

LUNG TISSUE

METAPROTEOME (WENDT LAB)

Bronchoscopy

53 genera (notably Pyramidobacter, Orbibacterium, Fusobacterium) and 71 species (e.g. Pyramidobacter piscolens) were identified from 776 microbial peptides.

IPR000652 Triosephosphate isomeraseIPR012726 Thiazole biosynthesis ThiH

IPR013126 Heat shock protein 70 family

IPR004473 Restriction endonuclease, type I, HsdRIPR020449 Transcription regulator HTH, AraC- type

IPR000600 ROK family

IPR002133 S-adenosylmethionine synthetase

IPR001270 ClpA/B family

IPR001672 Phosphoglucose isomerase (PGI)IPR005632 Chaperone protein SkpIPR005996 Ribosomal protein L30, bacterial-type IPR011147 Bifunctional aspartokinase/homoserine dehydrogenase I

IPR012094 tRNA(Ile)-lysidine synthaseIPR001404 Heat shock protein Hsp90 family

IPR001576 Phosphoglycerate kinase

IPR002177 DNA-binding protein Dps

IPR002347 Short-chain dehydrogenase/reductase SDRIPR004506 tRNA-specific 2-thiouridylase

IPR004629 Glycosyl transferase WecB/TagA/CpsF IPR005259 Primosomal protein N'

IPR005294 ATPase, F1 complex, alpha subunitIPR005564 Major capsid protein GpE

IPR005662 GTP-binding protein Era

IPR005719 Dihydroorotate dehydrogenase, class 2

IPR005722 ATPase, F1 complex, beta subunit

IPR005746 Thioredoxin

IPR010949 TonB-dependent haemoglobin/transferrin/lactoferrin receptor

IPR016718 rRNA (guanine-N1-)-methyltransferase A, predicted

IPR017821 Succinate CoA transferase

IPR018320 DNA polymerase 1

IPR024706 Peroxiredoxin, AhpC-type

IPR000015 Outer membrane usher proteinIPR000206 Ribosomal protein L7/L12

IPR000394 RNA polymerase sigma factor 54IPR000447 FAD-dependent glycerol-3-phosphate dehydrogenase

IPR000577 Carbohydrate kinase, FGGYIPR000771 Ketose-bisphosphate aldolase, class-II

IPR000941 Enolase

IPR000944 Transcription regulator Rrf2-typeIPR001272 Phosphoenolpyruvate carboxykinase, ATP-utilising

IPR001308 Electron transfer flavoprotein alpha subunit/FixB IPR001474 GTP cyclohydrolase IIPR001645 Folylpolyglutamate synthetaseIPR001851 ABC transporter, permease

IPR002155 Thiolase

IPR002201 Glycosyl transferase, family 9

IPR002504 NAD kinase

IPR003004 GspF/PilC familyIPR004387 Peptidase M50, putative membrane-associated zinc metallopeptidaseIPR004396 Ribosome-binding ATPase YchF/Obg-like ATPase 1

IPR004429 Isopropylmalate dehydrogenase

IPR004665 Transcription termination factor Rho

IPR004730 Transaldolase type 1

IPR004763 Cation efflux system CzcA/CusA/SilA/NccA/HelA/CnrA IPR004797 Competence protein ComEC/Rec2

IPR005763 L-fucose isomerase

IPR005841 Alpha-D-phosphohexomutase superfamily

IPR005950 Molybdenum ABC transporter, periplasmic binding protein

IPR005982 Thioredoxin reductase

IPR006034 Asparaginase/glutaminaseIPR006043 Xanthine/uracil/vitamin C permease

IPR006101 Glycoside hydrolase, family 2

IPR006129 Adhesin B

IPR006264 3-phosphoshikimate 1-carboxyvinyltransferaseIPR006275 Carbamoyl-phosphate synthase, large subunitIPR006276 Cobalamin-independent methionine synthase IPR006308 DNA polymerase III, alpha subunit, Gram-positive type

IPR006439 HAD hydrolase, subfamily IAIPR009948 Syd

IPR010131 RND efflux system, outer membrane lipoprotein, NodT IPR011179 Isopentenyl-diphosphate delta-isomerase, FMN-dependent IPR011701 Major facilitator superfamilyIPR011825 23S rRNA (uracil(747)-C(5))-methyltransferase RlmC

IPR011877 RibokinaseIPR013440 Tryptophanase

IPR014046 Diadenylate cyclaseIPR014101 Carotene isomerase IPR014780 tRNA pseudouridine synthase II, TruB

IPR015807 Histidine-tRNA ligase

IPR015963 Uridylate kinase, bacteria

IPR020591 Chromosomal replication control, initiator DnaA-like IPR022801 Ribosomal protein S4/S9

IPR023468 Riboflavin kinase

IPR024904 Ornithine carbamoyltransferase

IPR025705 Beta-hexosaminidase

IPR026569 Ribosomal protein L28/L24

IPR026892 Glycoside hydrolase family 3

IPR028362 Alginate O-acetyltransferase AlgI IPR028829 Exo-beta-D-glucosaminidase IPR029703 DNA polymerase epsilon catalytic subunit

IPR030854 Ribonuclease J, bacteria

IPR030963 3-dehydroquinate synthase family

IPR031322 Shikimate kinase/gluconokinase

IPR032816 SNARE associated Golgi proteinIPR033173 Translational activator Gcn1

TAXONOMY ANALYSIS USING METAPROTEOME DATA

FUNCTIONAL ANALYSIS USING METAPROTEOME DATA

71 genera (For e.g, Bacteroides, Lactobacillus) and 84 species were identified

from 729 microbial peptides

TAXONOMY ANALYSIS (METAPROTEOMICS)

336 microbial peptides could be assigned to 132 InterPro2GO proteins, notably the ATPase, F1 complex (19% of reads)

FUNCTIONAL ANALYSIS (METAPROTEOMICS)

Metaproteomic methods hold

promise in unravelling the

mechanistic details of microbial

interactions with host/environment

by offering insights into functional dynamics of the

microbiome. From presentation at the 64th ASMS Annual Conference 2016

Methods

SATURDAY - 604Contact Email: [email protected]

Phone: 612-816-4232

•  Metaproteomics characterizes proteins expressed by complex microbial communities, which in turn can enable new insights into the functional dynamics of microbiomes.

•  Recent studies suggest that functional characterization via metaproteomic analysis might more accurately measure the real-time response of microbial communities to perturbations in their environment.

•  Using the Galaxy-P framework, we demonstrate the use of workflows as a metaproteomic informatics solution.

•  MEGAN5 was used to generate functional characterization of the metaproteomes using SEED pathway analysis.

•  Taxonomically-diverse sucrose-pulsed microcosms clustered more closely by function. Function-based changes may be better indicators of dysbiosis.

•  Statistical analysis identified proteins that were expressed at a higher abundance in presence of sugar.

•  Selected reaction monitoring (SRM) was used to quantify conserved peptides from three enzymes whose abundance increased in sucrose-induced dysbiosis: NADP-specific glutamate dehydrogenase (EC 1.4.1.4) ; Ketol-acid reductoisomerase (EC 1.1.1.86) ; L-lactate dehydrogenase (EC 1.1.1.27).

•  Our results demonstrate advantages of Galaxy-P for metaproteomic analysis, such as easy access to a wide variety of tools for a wide community of researchers.

•  Metaproteomic analysis can identify functional components of diverse microbiomes such as dental plaque, lung tissue & BALF.

•  A Metaproteomics Gateway -z.umn.edu/metaproteomicsgateway - on JetStream instance is available for metaproteomics researchers and software developers.

•  We are currently in the process of adding new metaproteomic tools and workflows to the Galaxy-P platform.

ACKNOWLEDGEMENTS:

We acknowledge NSF grant 1458524, and NIH grant 1U24CA199347 for funding support to the Galaxy-P

team at the University of Minnesota. Alexa Pragman is supported by VA ORD 1 IK2 CX001095-01A2 grant. We

also thank the Center for Mass Spectrometry and Proteomics at the University of Minnesota.