4.4 Article

Multivariate strategy for the sample selection and integration of multi-batch data in metabolomics

Journal

METABOLOMICS
Volume 13, Issue 10, Pages -

Publisher

SPRINGER
DOI: 10.1007/s11306-017-1248-1

Keywords

OPLS; Metabolomics; Multi-batch analysis; Representative sample selection

Funding

  1. AstraZeneca-Karolinska Institute Joint Research Program in Translational Science
  2. Apotekare Hedbergs Foundation
  3. Sigurd and Elsa Goljes Memorial Fund
  4. Swedish Research Council
  5. Swedish Rheumatism Association, King Gustaf V's 80-year Foundation
  6. Stockholm County Council
  7. Karolinska Institute Foundation
  8. Swedish Heart-Lung Foundation

Ask authors/readers for more resources

Introduction Availability of large cohorts of samples with related metadata provides scientists with extensive material for studies. At the same time, recent development of modern high-throughput 'omics' technologies, including metabolomics, has resulted in the potential for analysis of large sample sizes. Representative subset selection becomes critical for selection of samples from bigger cohorts and their division into analytical batches. This especially holds true when relative quantification of compound levels is used. Objectives We present a multivariate strategy for representative sample selection and integration of results from multi-batch experiments in metabolomics. Methods Multivariate characterization was applied for design of experiment based sample selection and subsequent subdivision into four analytical batches which were analyzed on different days by metabolomics profiling using gas-chromatography time-of-flight mass spectrometry (GC-TOFMS). For each batch OPLS-DA (R) was used and its p(corr) vectors were averaged to obtain combined metabolic profile. Jackknifed standard errors were used to calculate confidence intervals for each metabolite in the average p(corr) profile. Results A combined, representative metabolic profile describing differences between systemic lupus erythematosus (SLE) patients and controls was obtained and used for elucidation of metabolic pathways that could be disturbed in SLE. Conclusion Design of experiment based representative sample selection ensured diversity and minimized bias that could be introduced at this step. Combined metabolic profile enabled unified analysis and interpretation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available