4.6 Article

Normalizing and Correcting Variable and Complex LC-MS Metabolomic Data with the R Package pseudoDrift

期刊

METABOLITES
卷 12, 期 5, 页码 -

出版社

MDPI
DOI: 10.3390/metabo12050435

关键词

maize; metabolomics; LC-MS; signal drift; data normalization

资金

  1. National Science Foundation [1733633]
  2. University ofWisconsin-Madison SciMed GRS fellowship
  3. Graduate School, Office of Vice Chancellor for Research and Graduate Education at the University ofWisconsin-Madison
  4. Wisconsin Alumni Research Foundation
  5. Michigan State University under the Training Program in Plant Biotechnology for Health and Sustainability [T32-GM110523]
  6. Direct For Biological Sciences
  7. Division Of Integrative Organismal Systems [1733633] Funding Source: National Science Foundation

向作者/读者索取更多资源

This article discusses the importance of liquid chromatography-mass spectroscopy (LC-MS) in biological research and the challenges posed by technical errors. A new approach called pseudoDrift is proposed to detect and correct these errors, with demonstrated effectiveness and flexibility through data simulation. The analysis of a phenolic compound dataset from non-transgenic maize inbred lines showcases the application of this method in studying the dynamics of specialized metabolism in plants.
In biological research domains, liquid chromatography-mass spectroscopy (LC-MS) has prevailed as the preferred technique for generating high quality metabolomic data. However, even with advanced instrumentation and established data acquisition protocols, technical errors are still routinely encountered and can pose a significant challenge to unveiling biologically relevant information. In large-scale studies, signal drift and batch effects are how technical errors are most commonly manifested. We developed pseudoDrift, an R package with capabilities for data simulation and outlier detection, and a new training and testing approach that is implemented to capture and to optionally correct for technical errors in LC-MS metabolomic data. Using data simulation, we demonstrate here that our approach performs equally as well as existing methods and offers increased flexibility to the researcher. As part of our study, we generated a targeted LC-MS dataset that profiled 33 phenolic compounds from seedling stem tissue in 602 genetically diverse non-transgenic maize inbred lines. This dataset provides a unique opportunity to investigate the dynamics of specialized metabolism in plants.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据