6
Please cite this article in press as: H.G. Gika, et al., J. Chromatogr. B (2014), http://dx.doi.org/10.1016/j.jchromb.2014.01.054 ARTICLE IN PRESS G Model CHROMB-18766; No. of Pages 6 Journal of Chromatography B, xxx (2014) xxx–xxx Contents lists available at ScienceDirect Journal of Chromatography B jou rn al hom ep age: www.elsevier.com/locate/chromb LC–MS-based holistic metabolic profiling. Problems, limitations, advantages, and future perspectives Helen G. Gika a,, Ian D. Wilson b , Georgios A. Theodoridis c a Department of Chemical Engineering, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece b Department of Surgery and Cancer, Faculty of Medicine, Imperial College, London, UK c Department of Chemistry, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece a r t i c l e i n f o Article history: Received 30 November 2013 Accepted 25 January 2014 Available online xxx Keywords: Metabolomics Metabonomics Standardisation Quality control Mass spectrometry a b s t r a c t The present review aims to critically discuss some of the major problems and limitations of LC–MS based metabolomics as experienced from an analytical chemistry standpoint. Metabolomics offers dis- tinct advantages to a variety of life sciences. Continuous development of the field has been realised due to intensive efforts from a great many scientists from widely divergent backgrounds and research inter- ests as demonstrated by the contents of this special issue. The aim of this commentary is to describe current hindrances to field’s progress, (some unique to metabolomics, some common with other omics fields or with conventional targeted bioanalysis) to propose some potential solutions to overcome these constraints and to provide a future perspective for likely developments in the field. © 2014 Elsevier B.V. All rights reserved. 1. Introduction Metabonomics has been defined as the “the quantitative mea- surement of the dynamic multiparametric response of a living system to pathophysiological stimuli or genetic modification” [1]. Metabolomics or metabolic profiling deal with the study of the metabolic content (small molecule complement) of the analysed sample, cell or organ. These terms are now practically used inter- changeably. No matter the term used the field has evolved greatly over the last years [1–3] as seen by a significant increase in size of the field, the increasing number of active research groups and the continuing high degree of investment in capital equipment and facilities. As a result the numbers of publications and cita- tions although still considerably smaller than those of the other two Abbreviations: APCI, atmospheric pressure chemical ionisation; CE, capil- lary electrophoresis; DoE, design of experiment; ESI, electrospray ionisation; FT-ICR-MS, Fourier transformation ion cyclotron resonance mass spectrometry; GC–MS, gas chromatography–mass spectrometry; HILIC, hydrophilic interac- tion chromatography; HRMS, high resolution mass spectrometry; LC–MS, liquid chromatography–mass spectrometry; MetID, metabolite identification; PCR, poly- merase chain reaction; QC, quality control; QTOF-MS, quadrupole time of flight mass spectrometry; SFC, supercritical fluid chromatography. This paper is part of the special issue “Metabolomics II” by G. Theodoridis and D. Tsikas. Corresponding author. Tel.: +30 2310996224. E-mail address: [email protected] (H.G. Gika). major omics fields (genomics and proteomics) exhibit significantly faster growth rates as shown in Fig. 1. In the early days of holistic metabolic profiling, proton nuclear magnetic resonance (NMR) spectroscopy was the dominant analyt- ical tool, and enabled major advances in the art of global metabolite profiling, but more recently chromatographic separations (most especially gas and, in particular, liquid chromatography) coupled to mass spectrometry (MS) (GC–MS and LC–MS), have become the major instrumental approaches [4]. Other separation tech- niques such as capillary electrophoresis (CE) and supercritical fluid chromatography (SFC) have also been applied, but to date have not achieved to become widely adopted [4]. Overall in the broad bioanalytical field of global metabolic profiling, LC–MS currently represents the major instrumental technology; it also shows strong penetration in other omics fields such as proteomics where it con- tinues to displace 2D gel electrophoresis. In metabolomics the prominence of LC–MS can be attributed mostly to the following reasons: 1) The large number of available instruments coupled with a range of suitable vendor-specific and open source data processing soft- ware capabilities and a pool of trained operators. 2) The wide metabolite coverage provided by LC–MS, often with high sensitivity and specificity, which brings the field closer to the attainment of true “holistic” metabolic profiling. 3) The versatility of the technology: thus a single LC–MS instru- ment combination can be used for a variety of different http://dx.doi.org/10.1016/j.jchromb.2014.01.054 1570-0232/© 2014 Elsevier B.V. All rights reserved.

LC–MS-based holistic metabolic profiling. Problems, limitations, advantages, and future perspectives

Embed Size (px)

Citation preview

Page 1: LC–MS-based holistic metabolic profiling. Problems, limitations, advantages, and future perspectives

C

La

Ha

b

c

a

ARAA

KMMSQM

1

ssMmscootat

lFGtcms

D

h1

ARTICLE IN PRESSG ModelHROMB-18766; No. of Pages 6

Journal of Chromatography B, xxx (2014) xxx–xxx

Contents lists available at ScienceDirect

Journal of Chromatography B

jou rn al hom ep age: www.elsev ier .com/ locate /chromb

C–MS-based holistic metabolic profiling. Problems, limitations,dvantages, and future perspectives�

elen G. Gikaa,∗, Ian D. Wilsonb, Georgios A. Theodoridisc

Department of Chemical Engineering, Aristotle University of Thessaloniki, 54124 Thessaloniki, GreeceDepartment of Surgery and Cancer, Faculty of Medicine, Imperial College, London, UKDepartment of Chemistry, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece

r t i c l e i n f o

rticle history:eceived 30 November 2013ccepted 25 January 2014vailable online xxx

a b s t r a c t

The present review aims to critically discuss some of the major problems and limitations of LC–MSbased metabolomics as experienced from an analytical chemistry standpoint. Metabolomics offers dis-tinct advantages to a variety of life sciences. Continuous development of the field has been realised dueto intensive efforts from a great many scientists from widely divergent backgrounds and research inter-

eywords:etabolomicsetabonomics

tandardisationuality control

ests as demonstrated by the contents of this special issue. The aim of this commentary is to describecurrent hindrances to field’s progress, (some unique to metabolomics, some common with other omicsfields or with conventional targeted bioanalysis) to propose some potential solutions to overcome theseconstraints and to provide a future perspective for likely developments in the field.

© 2014 Elsevier B.V. All rights reserved.

ass spectrometry

. Introduction

Metabonomics has been defined as the “the quantitative mea-urement of the dynamic multiparametric response of a livingystem to pathophysiological stimuli or genetic modification” [1].etabolomics or metabolic profiling deal with the study of theetabolic content (small molecule complement) of the analysed

ample, cell or organ. These terms are now practically used inter-hangeably. No matter the term used the field has evolved greatlyver the last years [1–3] as seen by a significant increase in sizef the field, the increasing number of active research groups andhe continuing high degree of investment in capital equipment

Please cite this article in press as: H.G. Gika, et al., J. Chromatogr. B (20

nd facilities. As a result the numbers of publications and cita-ions although still considerably smaller than those of the other two

Abbreviations: APCI, atmospheric pressure chemical ionisation; CE, capil-ary electrophoresis; DoE, design of experiment; ESI, electrospray ionisation;T-ICR-MS, Fourier transformation ion cyclotron resonance mass spectrometry;C–MS, gas chromatography–mass spectrometry; HILIC, hydrophilic interac-

ion chromatography; HRMS, high resolution mass spectrometry; LC–MS, liquidhromatography–mass spectrometry; MetID, metabolite identification; PCR, poly-erase chain reaction; QC, quality control; QTOF-MS, quadrupole time of flight mass

pectrometry; SFC, supercritical fluid chromatography.� This paper is part of the special issue “Metabolomics II” by G. Theodoridis and. Tsikas.∗ Corresponding author. Tel.: +30 2310996224.

E-mail address: [email protected] (H.G. Gika).

ttp://dx.doi.org/10.1016/j.jchromb.2014.01.054570-0232/© 2014 Elsevier B.V. All rights reserved.

major omics fields (genomics and proteomics) exhibit significantlyfaster growth rates as shown in Fig. 1.

In the early days of holistic metabolic profiling, proton nuclearmagnetic resonance (NMR) spectroscopy was the dominant analyt-ical tool, and enabled major advances in the art of global metaboliteprofiling, but more recently chromatographic separations (mostespecially gas and, in particular, liquid chromatography) coupledto mass spectrometry (MS) (GC–MS and LC–MS), have becomethe major instrumental approaches [4]. Other separation tech-niques such as capillary electrophoresis (CE) and supercritical fluidchromatography (SFC) have also been applied, but to date havenot achieved to become widely adopted [4]. Overall in the broadbioanalytical field of global metabolic profiling, LC–MS currentlyrepresents the major instrumental technology; it also shows strongpenetration in other omics fields such as proteomics where it con-tinues to displace 2D gel electrophoresis. In metabolomics theprominence of LC–MS can be attributed mostly to the followingreasons:

1) The large number of available instruments coupled with a rangeof suitable vendor-specific and open source data processing soft-ware capabilities and a pool of trained operators.

2) The wide metabolite coverage provided by LC–MS, often with

14), http://dx.doi.org/10.1016/j.jchromb.2014.01.054

high sensitivity and specificity, which brings the field closer tothe attainment of true “holistic” metabolic profiling.

3) The versatility of the technology: thus a single LC–MS instru-ment combination can be used for a variety of different

Page 2: LC–MS-based holistic metabolic profiling. Problems, limitations, advantages, and future perspectives

ARTICLE ING ModelCHROMB-18766; No. of Pages 6

2 H.G. Gika et al. / J. Chromatog

number of publica�ons per year

0

1000

2000

3000

4000

5000

6000

7000

8000

20122010200820062004

Genomics Proteomics Transcriptomics Metabolomics

Fig. 1. Plot of the number of publications found in Scopus per year using thesma

2p

a

1

2

3

4

5

3

lkwasa

earch terms in abstract, title, keywords: genomics, proteomics, metabonomics oretabolomics, transcriptomics. Search was conducted in the end of August 2013,

nd results for the year 2013 were extrapolated to the full length of the year.

functions (e.g., drug bioanalysis or environmental analysis)and then, with minimal or without modifications, can readilybe switched to endogenous metabolite profiling work. Inessence any contemporary MS instrument can be used formetabolomic/metabonomic profiling (though obviously withinstrument-dependant outcomes).

. Advantages offered by undertaking metabolomicrofiling

The major advantages and potential benefits offered by adopting holistic analytical approach to metabolic profiling include:

) the prospect of identifying novel markers: metabolites wherechanges in concentration/flux in response to the biological chal-lenge were unforeseen and/or unexpected with regard to theirbiochemical function,

) as a result of the above a gain in biochemical insight and newdescriptor(s) of the biochemical phenomena,

) integration of a “systems” approach as metabolite concen-tration information complements genomic/transcriptomic andproteomic data,

) the description of the real-time metabolic status of the stud-ied system. Metabolomics describes the metabolic status ofthe sample at the time of sample collection, providing asummary of the cells/organisms response to (among others)gene/environmental/drug interactions,

) to obtain equivalent metabolome coverage using conventionalclassical methods that cover all major groups of metabolites onehas to employ large number of different methods. Combinationof all results obtained in different instruments and by differentpractitioners would be impractical, compromising prospectivefor comprehensive analysis.

. Analytical technologies: current state of the art

Continuous developments in both LC and MS technology haveed to noteworthy advances in separations (e.g. UHPLC) and iney operating MS characteristics such as ionisation methods (e.g.

Please cite this article in press as: H.G. Gika, et al., J. Chromatogr. B (20

ith exciting recent developments in in situ desorption ionisationnd MS imaging [5,6]), sensitivity, mass accuracy, mass resolution,can speed and data acquisition rates, thus improving applicabilitynd utility for metabolic profiling. What is more, developments in

PRESSr. B xxx (2014) xxx–xxx

software have facilitated more efficient use of the wealth of databeing collected by the newer mass spectrometers. Such softwareallow for the automated and programmed implementation of dif-ferent real-time experiments within full scan analyses such asMS–MS/MSn experiments in data dependent acquisition mode,MS/MS with different collision energies (e.g. such as MSE) and soforth. Further to this, the data collected can be effectively scru-tinised by advanced software utilities (often offered as add-onson the basic operating platform) which can effectively combinedifferent levels of data (e.g. accurate mass with isotope ratiomeasurement from both precursor and product ions) to predictfragmentation patterns and reconstruct molecular formulae toachieve the putative identification of unknowns. In GC–MS thepotential for the generation of spectral libraries from electron ion-isation (EI) data has been exploited and a specific metabolomicsdatabase (Agilent Fiehn mass spectral library) has been developedwhich when used in addition to retention indices and existingspectral libraries that contain chemicals and pharmaceuticals, canprovide effective identification of known-unknown peaks (peakannotation).

4. Problems and limitations

Despite such important achievements the current state of the artof LC–MS-based metabolic profiling still meets hindrances not theleast of which is that no single mode of chromatography is yet capa-ble of comprehensively profiling the whole metabolome in a singleanalytical run (e.g., the limitations of RPLC for highly polar com-pounds requires other chromatographic modes to be used such asHILIC as well). Establishing unbiased analytical methods is not triv-ial because the metabolites (expected to be) present in biologicalsamples:

1) can show high diversity of chemical properties,2) can occur over wide concentration ranges (up to eight orders of

magnitude),3) derive from many different sources as they can be endogenous,

exogenous (xenobiotics such as nutrients or pharmaceuticals)or of symbiotic origin, and

4) as a result of the above an “unbiased” quantitative extraction ofall metabolites from their matrix can be considered unrealistic[4 and references included therein].

Further to these general limitations, issues specific to theapplication of hyphenated techniques include:

5) problems in combining data from different MS analyseswhich hinder correlation of data obtained in different instru-ments/laboratories [7]; as a result

6) large scale studies (of epidemiology level) need to overcomethe difficulty of standardisation of methods [8] and the riskof instrumental drifts such as retention time drifts (Fig. 2B)due to deterioration of the analytical column [9] or MS detec-tion instability due the MS ion source contamination. Analytequantification is a problem compared to conventional targetedmethods.

These issues result in7) the fragmentation of research effort, as it becomes clear that

with the current state of the art, profiling work cannot confi-dently be split between different laboratories and that differentresearchers may spend a lot of time identifying the same com-pound in different samples. A major problem is that resultsobtained in untargeted profiling mode in different laboratories

14), http://dx.doi.org/10.1016/j.jchromb.2014.01.054

or even in the same laboratory but in different instruments(e.g. Orbitrap-MS and QTOF-MS) cannot be directly comparedor easily correlated even when the same separation system isemployed [7].

Page 3: LC–MS-based holistic metabolic profiling. Problems, limitations, advantages, and future perspectives

ARTICLE IN PRESSG ModelCHROMB-18766; No. of Pages 6

H.G. Gika et al. / J. Chromatogr. B xxx (2014) xxx–xxx 3

Fig. 2. Utilisation of quality control measures is necessary to assess the system’s stability and suitability. (A) PCA scores plot depicting the stabilisation of the system following5 initial QC injections (system conditioning); QC replications form a tight cloud; analysis by LC–ESI-TOF-MS (unpublished data). (B) PCA scores plot showing a significant timetrend in data due to problematic analysis in LC–TOF-MS. In this example, a problem in the LC system occurred in the analysis of samples 30–31 and affected the subsequenta eptablP lyticad this fi

8

nalyses. (C) Plot of variables in time (X axis, RT in s) and m/z (Y axis). In green accCA scores plot depicting the day-to-day variability in a test of five repeats of an anaiffer to a significant extent in PC2. (For interpretation of the references to colour in

Please cite this article in press as: H.G. Gika, et al., J. Chromatogr. B (20

) a major bottleneck is the identification of candidate markers(metabolite identification or MetID) [10]. Unlike for GC–MSspectra where EI give a characteristic and reproducible “fin-gerprint” of the analysed molecule, LC–MS spectra are highly

e variables (in this case a CV filter of 30% is set), in red unacceptable variables. (D)l sample-set. The three first days cluster together, whereas the fourth and fifth daysgure legend, the reader is referred to the web version of the article.)

14), http://dx.doi.org/10.1016/j.jchromb.2014.01.054

variable: adduct and cluster formation is not controllable or pre-dictable. Fragmentation is not reproducible even in instrumentsof the same type. As a result commercial or freely availableLC–MS spectral libraries still represent an aspirational rather

Page 4: LC–MS-based holistic metabolic profiling. Problems, limitations, advantages, and future perspectives

ING ModelC

4 atog

9

dchcseslqtbmieasaa

ncagoUsacro

5

atiatadiafi

nt

1

ARTICLEHROMB-18766; No. of Pages 6

H.G. Gika et al. / J. Chrom

than a robust solution to the identification of unknowns. Con-sequently the annotation of metabolites or de novo structureelucidation in complex biological matrices typically requires sig-nificant effort that can add up to months or years [10].

) Finally, biochemical pathway analysis and the development ofrelated visualisation tools are still in their infancy.

The lack of standardised protocols from the phase of studyesign, sample collection and handling, up to the phase of chemi-al and statistical analysis remains an issue to be resolved. Protocolarmonisation is needed to facilitate successful large-scale appli-ation of metabolic profiling in life, food/plant or environmentalciences to deliver trustworthy results. However to successfullymbark on epidemiology-scale studies a number of problemshould be resolved: (1) Researchers need to make sure that the ana-ytical system remains stable, and this needs to be expressed in auantitative and reportable manner. (2) If large analytical runs needo be split into batches (a strategy which is in practice unavoidable),atch to batch variation (for an example see Fig. 2D) needs to beinimised and data from these batches needs to be combined [11],

n this aspect implementation of reference or QC samples withinach batch can assist in assessing the quality of the data (Fig. 2And C). (3) Practical interventions such as the clean-up of an ion-ource or the chromatographic system or even the replacement of

chromatographic column need to be studied so that results beforend after this modification can be safely combined.

In targeted analyses these issues are addressed using e.g. inter-al standards. After years of development and debate, clear-cutriteria have been established: FDA guidelines and EC directivesssist the practitioner in the validation of bioanalytical methodolo-ies (e.g. EC Directive 86/469 and 657/2002/EC, periodical issuesf Guidance for Industry Bioanalytical Method Validation from theDA FDA). However similar guidelines do not exist despite efforts

uch as the metabolomics standards initiative [12,13] for holisticnalytical approaches and it is basically unrealistic to correlate,ompare or repeat results reported in the literature. Practically eachesearch group follows their own (if any) route for the validationf methods and results, marker identification and reporting.

. Perspective for the way forward

To solve multi-faceted problems such as those mentionedbove, multidisciplinary and inter-sectoral collaboration is neededhrough the co-operation of analytical chemists, biochemists,nstrument and software developers, informaticians, statisticiansnd life scientists. The wealth of the generated data is the key fea-ure of holistic metabolic profiling, but this same characteristic islso the source for a number of problems. Meticulous experimentalesign and research work are necessary to enable the researcher to

dentify the occurrence of any of the failures mentioned above andvoid the risk of comprehending analytical or chemical bias as anding of biological significance.

In the following paragraphs key issues which in our opinioneed further ellaboration are highligthed and ideas for addressinghem are described:

) Design of experiment (DoE): Although the most recent of themajor omic fields, metabolomics can not be based in the “legacy”of the pioneers of genomic or proteomic research. Metabolomicscan indeed profit from the knowledge and experience obtainedduring the development of the genomic and proteomic fields.

Please cite this article in press as: H.G. Gika, et al., J. Chromatogr. B (20

Perhaps the most important lesson is the high rate of fail-ure of proposed biomarkers [14]. However in their core theseomics disciplines differ for variety of reasons; a major differen-tiating parameter is the nature of the target molecules: genes

PRESSr. B xxx (2014) xxx–xxx

and proteins are practically polymers, generated by repetitionof a small number of units (four bases or 20 amino acids),thus the process of biomarker identification is very different.Another factor is that genomics benefits from the existenceof protocols for signal amplification (based on PCR) whereassimilar amplification is not applicable in metabolics profiling.The fact is that the metabolomics research community needsto address and set-up new rules specific for the design ofappropriate metabolomic experimentation. Designing succesfulexperiments for metabolic profiling should be the job of expertsof the field (i.e. statisticians with experience in metabolomics) inclose collaboration with the analytical scientists. If existing sam-ple sets (not originally collected for metabolomics studies) are tobe analysed, caution is needed to avoid interferences from con-founding factors such as (in the case of samples from humans)lifestyle effects, sub-clinical illness, mistakes in sample collec-tion, enzymatic degradation and so forth.

The selection of controls is another important step that needsto be debated beforehand. Certain logical rules already knownin the field of proteomics should be applied: Controls and testgroup should ideally have the same genetic background, shouldbe matched for age, gender and other factors. If the aim is to pro-file the metabolome of plants, the conditions of growth shouldbe the same; in the case of animal models these should be housedin the same facility. Finally the number of samples to be analysedremains a topic of some debate. Statistical power demands anappropriate number of samples being analysed which if large,may be in contradiction with the real-life practicalities of thestudy, and indeed of the capabilities of the analytical laboratory.

This last problem is common to all omics studies; inevitablya compromise is often reached. On the whole, the time spenton properly designing a metabolomics experiment can neverbe excessive. DoE should aim to cover all possible problems,but also to justify the experiment and prioritise the differentanalytical expeditions available (paragraph 3, Improvement ofmetabolome coverage) essentially in a value to money/timeratio. This can help in the selection of the analytical tools to beused for a specific task.

2) Analytical developments for quality control and validation: Inmetabolic profiling, stability issues may emanate from bothinstrument ends: chromatography (retention time drift, generaldegradation of column performance, etc.) and mass spectrome-try (sensitivity loss due to e.g. contamination in the ion source).In GC–MS the stability of the derivatised samples is limited. Suchissues may result in considerable run-order trends, which maymask biological differentiation and delay biomarker discovery.Efforts are needed to find ways to minimise and avoid suchproblems. Such developments are even more necessary for thesuccessful application of metabolic profiling in large-scale epi-demiological studies where hundreds or thousands of analysesneed to take place. One factor that needs to be considered is thatthese phenomena are also related to the type of specimen ana-lysed. With the current state of the art, some hundreds of urinesample injections can be performed in LC–MS without problems[15]. For blood derived samples (serum/plasma) or tissue thenumber is lower due to the modification of the analytical col-umn from the binding of lipophilic species, indicating the needfor further developments [16]. Similar considerations need to betaken in GC–MS analysis.

Implementation of QC measures (injection of QC (see Fig. 2A)and reference samples) in the analytical sequence can help in therealisation of such endeavours. Explicit criteria for QC accep-

14), http://dx.doi.org/10.1016/j.jchromb.2014.01.054

tance should also be established in data treatment and datamining. It is now well understood that MS-generated data forholistic profiling are very complex and contains a high per-centage of noise. It is well understood that the majority of

Page 5: LC–MS-based holistic metabolic profiling. Problems, limitations, advantages, and future perspectives

ING ModelC

atog

3

4

ARTICLEHROMB-18766; No. of Pages 6

H.G. Gika et al. / J. Chrom

the detected features do not correspond to unique metabolites.Hence application of judicious filters should precede multivari-ate statistical analysis. These filters could enforce cut-off criteriain order to eliminate false signals, noise, ghost peaks or peakswith a systematic trend e.g. like those corresponding to contam-inant built-up on column. Rules such as i.e. filtering with the CV%of peak areas in QC injections [17–19] (Fig. 2C) or the 80% rule[20] effectively reduce the contribution of unstable peaks andeliminate noise from the dataset.

Further to this, differences in the performance of various soft-ware (either commercial or open source algorithms) used fordata extraction should be systematically evaluated. Bespokesoftware, typically developed by instrument vendors as add-on of their MS operating platform offer limited freedom foroptimisation of the data extraction process whilst the user isnot fully aware of the meaning of certain functions or param-eters selected. Alteration of the data treatment method is notreported within the data set or the generated files meant tobe exported as peak tables. Re-analysis of the same sample-setwith different software settings could provide notable changesand this could pass un-noticed. Utilisation of open sourcesoftware allows bigger flexibility but still needs advanced com-puting abilities such as experience in R computing language orMATLAB.

) Improvement of metabolome coverage: After immense researchfrom various groups, several methods and protocols have beenestablished for metabolic profiling using NMR spectroscopy,GC–MS and LC–MS analysis. However these methods do notcover the whole of metabolic space. NMR spectroscopic anal-ysis is limited by sensitivity and can typically detect onlythe most abundant metabolites contained in the analysedsample. GC–MS is limited by the volatility and size of theanalysed molecules. LC–MS is mostly realised in reversed-phase (RP) [4] or hydrophilic interaction chromatography(HILIC) modes [21,22] (whilst detectability and sensitivity arestructure-dependent). Although these LC modes are largelycomplementary, one to the other, there are still numerousimportant metabolites or even metabolic groups which are notpractically analysed by either of them. RPLC on C18 media willcover metabolites of medium to low polarity, but will not effec-tively separate very polar, ionic, and very apolar molecules.Although C18 phases can find use in the analysis of lipids, utilisa-tion of a C30 column may be beneficial for the analysis of certainlipid species e.g. carotenoids and steroids [23]. HILIC can ana-lyse some very polar molecules, but can hardly accommodate allionic species in one injection. It may be necessary to develop twomethods for polar metabolites: one for acidic and one for basicmolecules. In any case the analysis of phosphorylated metabo-lites is very difficult, thus necessitating another analysis modeand ion pair liquid chromatography has been proposed for thistask by several researchers [24–26]. It is thus becoming clear thatcurrently an array of techniques needs to be used in the questfor high metabolome coverage. In such a case a fair number ofmetabolites will appear in more than one mode although mostlikely with different analytical sensitivity. This strategy althoughbeing more comprehensive in terms of metabolome coverageresults in another major issue in current metabolomics which is

) the slow progress of the fusion of data collected on different ana-lytical platforms. For example it is extremely difficult or evenunrealistic to try to combine data obtained with different exper-imental configurations (e.g. HILIC–MS and RPLC–MS; APCI-MSand ESI-MS). The development of advanced data mining tools is

Please cite this article in press as: H.G. Gika, et al., J. Chromatogr. B (20

necessary to embark on such endeavours; however it is advisedthat such efforts start off with high quality data from coherentsample sets and well-planned experiments.

PRESSr. B xxx (2014) xxx–xxx 5

At the same time an important development would be theestablishment of readily available tools that would bind togetherdata from different analytical batches or data obtained in dif-ferent time periods (i.e. several weeks apart). Inter-laboratorystudies, using initially the same and at a later stage differenttypes of instrumentation/software, should be organised in orderto pave the way for such developments which could aim in over-coming research effort fragmentation.

5) Metabolite identification (MetID) remains a major bottleneckin contemporary LC–MS metabolomics. The combination ofhigh resolution MS (HRMS), MS/MS analyses, and advancedinformatics tools are deemed as the most promising tools forMetID. For metabolite annotation (that is the assignment of adetected peak to a known metabolite) candidate lists are gen-erated based on accurate mass detected. However, the processfrom this point is not straightforward as the number of puta-tive identities can be very large and diverse (i.e. 10 or morecandidate molecules per feature) even when using sub 2 ppmmass accuracy. Incorporation of independent (orthogonal) data,such as retention time data, can promote metabolite identifi-cation. In the current practice of metabolic profiling by LC–MSresearchers may rely almost solely on MS data in their efforts toidentify unknown metabolites and this can be regarded inad-equate as the chromatographic properties (such as retentiondata) do not contribute significantly (if they contribute at all)to the MetID process. As retention data can act as a surrogatefor e.g., Log P the lists of putative identities can be reducedby ruling-out candidates by suggesting that a structure(s) isunlikely based on its chromatographic properties. Our grouphas applied retention data prediction algorithms developed inboth RPLC and HILIC profiling of polar metabolites [27,28]. Inthe same line other researchers have exploited retention “pro-jection” as a supplementary means for compound identification[29]. Quadrupole MS was able to identify more compounds thanan FT-ICR-MS when the quadrupole spectra were assisted withretention information. A recent study analysed 120 authen-tic standard metabolites to develop a quantitative structureretention relationship model, incorporating six physicochemi-cal variables in a multiple-linear regression, and exhibited goodpredictive ability for retention times of a range of metabo-lites in HILIC mode [30]. In the end however, identificationis only proven when backed up by demonstration of identi-cal behaviour (chromatographic and spectrometric) with anauthentic standard. Once this is done, those potential biomark-ers have to be validated both by repeating a study and samplesshould be reanalysed using compound-specific validatedmethods.

6) Finally data visualisation remains a considerable hurdle. Evenin the case where metabolites are identified, comprehension ofsuch data it is not trivial within complex metabolic pathwaysand fluxes. A recent trend is seen towards the developmentof biochemical pathway analysis tools to effectively search inthe generated multidimensional data [31], exploiting existingknowledge (e.g. public databases such as KEGG, HMDB [32], lipidmaps) to build associations between the collected signals. Suchinitiatives include Biocyc, Metabolights [33], Reactome, MGIGenome, and MassTrix [31]. Incorporation of data from knownbiochemical pathways may assist in MetID, identifying signals(peaks) that represent missing metabolites in half-covered path-ways or known-unknowns (i.e. peaks that are frequently foundbut are not identified or their MetID is not confirmed). Efficientpathway visualisation can be seen as a drive from signals to

14), http://dx.doi.org/10.1016/j.jchromb.2014.01.054

molecules; from data to information. This in the end may rep-resent a step forward, closer to the perspective of translationalmedicine.

Page 6: LC–MS-based holistic metabolic profiling. Problems, limitations, advantages, and future perspectives

ING ModelC

6 atog

6

tiapm(Cdchm

A

EfLR

R

[[[

[

[[[

[

[

[

[

[

[

[[[

[

[

[

[

[

[[

[

ARTICLEHROMB-18766; No. of Pages 6

H.G. Gika et al. / J. Chrom

. Conclusions

Overall the field of metabolomics has evolved greatly duringhe last decade. The number of active researchers has rapidlyncreased. Large scale initiatives promoted the topic, increasedwareness and provided proof of concept; multi-million Eurosrojects such as the Human Metabolome Database [32], Huser-et [16], COMET [34], the Netherlands Metabolomics Center

http://www.metabolomicscentre.nl/), the UK National Phenomeenter [35] and other projects have put intensive efforts ineveloping new methods and promoting biomarker discovery inertain application fields. Research needs now to address moreorizontal aspects to achieve harmonisation of practices andethods.

cknowledgment

Authors Gika and Theodoridis acknowledge co-financing by theuropean Union (European Social Fund- ESF) and Greek nationalunds through the Operational Program “Education and Lifelongearning” of the National Strategic Reference Framework (NSRF)esearch Funding Program: Thales II.

eferences

[1] J.K. Nicholson, J. Lindon, Nature 455 (2008) 1054.[2] D.B. Kell, Curr. Opin. Microbiol. 7 (2004) 296.[3] M. Baker, Nat. Methods 8 (2011) 117.[4] G. Theodoridis, H. Gika, E. Want, I.D. Wilson, Anal. Chim. Acta 711 (2012) 7.[5] Y.L. Yang, Y. Xu, P. Straight, P.C. Dorrestein, Nat. Chem. Biol. 5 (2009) 885.[6] G. Corso, O. D’Apolito, M. Gelzo, G. Paglia, A. Dello Russo, Bioanalysis 2 (2010)

1883.[7] H.G. Gika, G.A. Theodoridis, M. Earll, R.W. Snyder, S.J. Sumner, I.D. Wilson, Anal.

Chem. 82 (2010) 8226.[8] A.W. Nicholls, Bioanalysis 4 (2012) 2195.[9] E. Zelena, et al., Anal. Chem. 81 (2009) 1357.10] L.C. Goeddel, G.J. Patti, Bioanalysis 4 (2012) 2199.

Please cite this article in press as: H.G. Gika, et al., J. Chromatogr. B (20

11] H.H. Draisma, et al., Anal. Chem. 82 (2010) 1039.12] M.S.I. Board Members, S.A. Sansone, T. Fan, R. Goodacre, J.L. Griffin, N.W. Hardy,

R. Kaddurah-Daouk, B.S. Kristal, J. Lindon, P. Mendes, N. Morrison, B. Nikolau,D. Robertson, L.W. Sumner, C. Taylor, M. van der Werf, B. van Ommen, O. Fiehn,Nat. Biotechnol. 25 (2007) 846.

[

[

PRESSr. B xxx (2014) xxx–xxx

13] O. Fiehn, D. Robertson, J. Griffin, M. van der Werf, B. Nikolau, N. Morrison,L.W. Sumner, R. Goodacre, N.W. Hardy, C. Taylor, Metabolomics 3 (2007)175.

14] J.P.A. Ioannidis, PLoS Med. 2 (2005) e124.15] H.G. Gika, G.A. Theodoridis, et al., Bioanalysis 4 (2012) 2239.16] W.B. Dunn, D. Broadhurst, P. Begley, E. Zelena, S. Francis-McIntyre, N. Anderson,

M. Brown, J.D. Knowles, A. Halsall, J.N. Haselden, A.W. Nicholls, I.D. Wilson, D.B.Kell, R. Goodacre, Nat. Protoc. 6 (2011) 1060.

17] G. Theodoridis, H.G. Gika, I.D. Wilson, Mass Spectrom. Rev. 30 (2011)884.

18] H.G. Gika, G.A. Theodoridis, J.E. Wingate, I.D. Wilson, J. Proteome Res. 6 (2007)3291.

19] H.G. Gika, E. Macpherson, G.A. Theodoridis, I.D. Wilson, J. Chromatogr. B 871(2008) 299.

20] S. Bijlsma, I. Bobeldijk, E.R. Verheij, R. Ramaker, S. Kochhar, I.A. Macdonald, B.van Ommen, A.K. Smilde, Anal. Chem. 78 (2006) 567.

21] K. Spagou, H. Tsoukali, N. Raikos, H. Gika, I.D. Wilson, G. Theodoridis, J. Sep. Sci.33 (2010) 716.

22] H.G. Gika, G.A. Theodoridis, U. Vrhovsek, F. Mattivi, J. Chromatogr. A 1259 (2012)121.

23] B. Gleize, M. Steib, M. André, E. Reboul, Food Chem. 134 (2012) 2560.24] J.M. Buescher, S. Moco, U. Sauer, N. Zamboni, Anal. Chem. 82 (2010) 4403.25] L. Coulier, R. Bas, S. Jespersen, E. Verheij, M.J. van dWerf, T. Hankemeier, Anal.

Chem. 78 (2006) 6573.26] W. Lu, M.F. Clasquin, E. Melamud, D. Amador-Noguez, A.A. Caudy, J.D. Rabi-

nowitz, Anal. Chem. 82 (2010) 3212.27] H. Gika, G. Theodoridis, F. Mattivi, U. Vrhovsek, A. Pappa-Louisi, Anal. Bioanal.

Chem. 404 (2012) 701.28] H. Gika, G. Theodoridis, F. Mattivi, U. Vrhovsek, A. Pappa-Louisi, J. Sep. Sci. 35

(2012) 376.29] P.G. Boswell, J.R. Schellenberg, P.W. Carr, J.D. Cohen, A.D. Hegeman, J. Chro-

matogr. A 1218 (2011) 6732.30] D.J. Creek, A. Jankevics, R. Breitling, D.G. Watson, M.P. Barrett, K.E.V. Burgess,

Anal. Chem. 83 (2011) 8703.31] B. Wägele, M. Witting, P. Schmitt-Kopplin, K. Suhre, PLoS ONE 7 (2012) e39860.32] D.S. Wishart, D. Tzur, C. Knox, R. Eisner, A.C. Guo, N. Young, D. Cheng, K. Jewell,

D. Arndt, S. Sawhney, C. Fung, L. Nikolai, M. Lewis, M.-A. Coutouly, I. Forsythe,P. Tang, S. Shrivastava, K. Jeroncic, P. Stothard, G. Amegbey, D. Block, D.D. Hau,J. Wagner, J. Miniaci, M. Clements, M. Gebremedhin, N. Guo, Y. Zhang, G.E. Dug-gan, G.D. Macinnis, A.M. Weljie, R. Dowlatabadi, F. Bamforth, D. Clive, R. Greiner,L. Li, T. Marrie, B.D. Sykes, H.J. Vogel, L. Querengesser, Nucleic Acids Res. 35(2007) D521.

33] K. Haug, R.M. Salek, P. Conesa, J. Hastings, P. de Matos, M. Rijnbeek, T.Mahendraker, M. Williams, S. Neumann, P. Rocca-Serra, E. Maguire, A.González-Beltrán, S.A. Sansone, J.L. Griffin, C. Steinbeck, Nucleic Acids Res.

14), http://dx.doi.org/10.1016/j.jchromb.2014.01.054

(2013) D781.34] J.C. Lindon, H.C. Keun, T.M. Ebbels, J.M. Pearce, E. Holmes, J.K. Nicholson, Phar-

macogenomics 6 (2005) 691.35] R. Mirnezami, J.M. Kinross, P.A. Vorkas, R. Goldin, E. Holmes, J. Nicholson, A.

Darzi, Ann. Surg. 255 (2012) 881.