High Resolution GC-MS Application: Metabolomics

Vladimir Tolstikov, PhD

Eli Lilly and Company

Sample Harvest and Storage

Biological Metadata

Sample Extraction Extraction Metadata

Sample Preparation RI internal standards, Derivatization

Sample Analysis QC, randomization

Standard Operational Procedure

Raw Data

Chromatography Metadata Mass Spectrometry Metadata

Metabolite Peak Annotation

Data normalization, background subtraction, detection limit

Analytical Protocols

Processed Data Collection and Organization Statistical Analysis Pathway Analysis

Experiment Submission

Volatiles Alchohols Organic acids

Essential oils Amino acids Organic amines

Esters Catecholamines Nucleosides

Perfumes Fatty acids Nucleotides

Terpenes Phenolics Oligosaccharides

Carotenoids Prostanglandins Peptides

Flavanoids Steroids Co-factors

Perfumes Sugar phosphates Polar Lipids

LC/MS GC/MS

PEGASUS GC-HRT accurate mass TOF Gerstel ALEX/CIS MultiPurpose Autosampler

Triple TOF 5600 accurate mass Triple quad 5500

Lilly Metabolomics Platform

Lilly Metabolomics Platform Data Analysis and Visualization

• Statistical analysis: An array of commonly used statistical and machine learning methods :

• univariate -fold change analysis, t-tests, volcano plot, and one-way ANOVA, correlation analysis;

• multivariate - principal component analysis (PCA), partial least squares - discriminant analysis (PLS-DA) and PCA-DA;

• clustering - dendrogram, heatmap, K-means, and self organizing map (SOM));

• supervised classification - random forests and support vector machine (SVM).

• Functional enrichment analysis: The analysis is based on several libraries containing ~6300 groups of biologically meaningful metabolite sets collected primarily from human studies;

• Metabolic pathway analysis: Pathway analysis (including pathway enrichment analysis and pathway topology analysis) and visualization for Human metabolic pathways with a total collection of 1173 pathways;

• Pathway analysis : MetPA, Ingenuity, GeneGo

Human urine GC/MS profiling

Throughput Quality

12:30.00 16:40.00 20:50.00 25:00.00 29:10.00 33:20.00 37:30.00 41:40.00 45:50.00

Time (min:sec)AIC

12:30.00 16:40.00 20:50.00 25:00.00 29:10.00 33:20.00 37:30.00 41:40.00 45:50.00

Time (min:sec)AIC

Mouse CSF

Sample volume - 2uL

Methoxyamine, MSTFA 2% TMSCI

1 uL splitless, CIS C4 injector

Detector EI 70ev

>60% probability score

>3000 peaks deconvoluted

>1200 names assigned

~ 75 metabolites identified

Metabolomics study requirements for

GC/MS instruments

GC-HRT

1 Sensitivity √

2 Fast acquisition √

3 Robustness √

4 Reproducibility √

Unique features

1 Routine stable high resolution √

2 Routine stable high mass accuracy √

3 True peak deconvolution √

4 Elemental composition assignment √

High Resolution, High Mass Accuracy: YES or NO ID

Case study Pancreatic Cancer

• PDAC patients - 119 Group 1 • Healthy volunteers – 55 Group 2 • Benign cyst – 41 Group 2a • Chronic pancreatitis – 32 Group 3 • Other cancers – 19 Group 4 Unpaired samples. Blood plasma analysis.

GC/TOF/MS - 70 polar metabolites,

LC/MS/MS (MRM) – panel: Eicosanoids, LPA, SP1, SPA1, Bile acids, PC. 30 non-polar metabolites

Study performed in UC Davis Genome Center, Davis CA, USA

Cohort Study Design

PLS-DA Random Forest

Cross platform data integration: Metabolomics data obtained from current study: 95 metabolites Transcriptomics data was retrieved from Pancreatic Expression Database: 255 genes

Experimental Data

Prediction

Carbohydrate Metabolism, Energy production, Small Molecule Biochemistry

Experimental Data

Carbohydrate Metabolism, Energy production, Small Molecule Biochemistry

Prediction

Small molecule biomarkers

Current study

Univariate Classic ROC analysis for selected metabolite ratios

100 cross validation (CV) were performed and the results were averaged to generate the plot with threshold averaging.

Multivariate ROC analysis (PLS-DA)

The prediction model was composed of 15 features. 21 random samples from each group were allocated as hold-out data for validation.

Group 0 – PDAC patients; Group 1 - controls red circles - predicted scores for hold-out samples Numbers – samples classified to the wrong group

The average accuracy based on 100 cross validations is 0.907. The accuracy for hold out data prediction is 0.905(38/42).

Performance Measure: Area under ROC curve Permutation Times: 100

Multivariate ROC analysis (PLS-DA)

AUC, sensitivity, specificity, and accuracy were 0.965, 95.0%, 95.0%, and 90.0%, respectively, according to the training set data.

• Screening a panel of biomarkers might be effective by embracing the idea that pancreatic adenocarcinoma has vast genetic heterogeneity, meaning no single biomarker exists that is strongly correlated with its diagnosis across the population of people who develop the disease.

• Using a statistical model, it is possible to determine that many of so called weak biomarkers, having 95 percent specificity for the disease, on average, have only a 32 percent sensitivity.

• Increasing number of weak biomarkers it would be possible to achieve required 99 percent sensitivity.

• There is hope for developing a panel that would have greater than 99 percent accuracy.

American Association for Cancer Research, Press Release 2012

Acknowledgments

Prof. Shiro Urayama, MD, UC Davis, Department of Gastroenterology and

Hepatology, Davis, CA, USA Dr. Jean-Noel Billaud, PhD, INGENUITY SYSTEMS, Redwood City, CA, USA Dr. Wei Zou, PhD, Kindra Brooks, BS, UC Davis, Genome Center, Davis, CA, USA

High Resolution GC-MS Application: Metabolomics · Lilly Metabolomics Platform Data Analysis and...

Documents

METABOLOMICS RESEARCH

Lilly Global External R&D · Venture Capital Lilly Ventures, Lilly Asia Ventures External Innovation Strategy In-license Partnerships Lilly Research& Fellowship Awards Strategic Alliances

Metabolomics - Bioconductor

An Integrated Approach to Metabolomics Studies: …tools.thermofisher.com/.../AN-656-LC-MSn-Metabolomics-AN64832-EN.… · An Integrated Approach to Metabolomics Studies: Discovery

Behavioral Metabolomics

Statistical strategies for avoiding false discoveries …dbkgroup.org/Papers/broadhurst_kell_metabolomics06.pdfStatistical strategies for avoiding false discoveries in metabolomics

OWL metabolomics 2014... · OWL metabolomics

Rising toLead - Providence High School...Lilly ’76, Pam (Lilly) Kraft ’77, Janine (Lilly) Kelty ’78, Patrick Lilly ’80, Mark Lilly ’81 and Amy (Lilly) Franklin ’97. She

Metabolomics, Bruker’s Complete Solution – Featuring ... s Complete Solution – Featuring MetaboScape and TASQ Metabolomics Power Your Metabolomics Studies The metabolome is the

Metabolomics - LMU

Metabolomics PCB 5530 Tom Niehaus Fall 2014. Learning Outcomes - Learn the basics of metabolomics - Understand the limitations of metabolomics - Things

4th International Conference and Exhibition on Metabolomics & …ww1.prweb.com/.../22/12189722/Metabolomics-2015_Brochure.pdf · 2014-09-22 · httpmetabolomicsconerencecom Metabolomics-2015

Data Analysis in Metabolomics - Ecetoc •Overview of metabolomics data processing workflow •Differences between metabolomics and transcriptomics data •Approaches to improving

Lilly Bamlanivimab Antibody Playbook - Eli Lilly and Company...Lilly Bamlanivimab Antibody Playbook ELI LILLY AND COMPANY|DECEMBER 2020 For the Emergency Use Authorization of bamlanivimabfor

From Statistical to Biological Interactions via Omics ... · Omics data 8 Genomics Epigenomics Transcriptomics Proteomics Metabolomics Phenomics ... Integrative network-based analysis

Development of a gas chromatography/mass spectrometry based metabolomics protocol by means of statistical experimental design

Progress in metabolomics standardisation and its ... · Progress in metabolomics standardisation and its . ... Progress in metabolomics standardisation and its significance in future

Metabolomics- ijp

NIH Public Access Abstract Metabolomics Data Methods …xbw/research/nihms656480.pdf · Statistical Analysis and Modeling of Mass Spectrometry-Based Metabolomics Data Bowei Xi, Haiwei

Roteomics and Metabolomics Application of omics technologies in toxicology: Proteomics and Metabolomics