4.8 Article

Computational Variation: An Underinvestigated Quantitative Variability Caused by Automated Data Processing in Untargeted Metabolomics

期刊

ANALYTICAL CHEMISTRY
卷 93, 期 25, 页码 8719-8728

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.analchem.0c03381

关键词

-

资金

  1. University of British Columbia Start-up Grant [F18-03001]
  2. Canada Foundation for Innovation [CFI 38159]
  3. UBC [F19-05720]
  4. New Frontiers in Research Fund/Exploration [NFRFE-2019-00789]
  5. National Science and Engineering Research Council (NSERC) [RGPIN-2020-04895]
  6. NSERC Discovery Launch Supplement [DGECR-2020-00189]

向作者/读者索取更多资源

This study explores how experimental factors contribute to computational variation in untargeted metabolomics, proposing a quality control sample-based correction workflow to minimize this variation. The bioinformatic solution is successfully demonstrated in a metabolomic comparison of leukemia patients before and after chemotherapy, showing significant changes in metabolic features.
Computational tools are commonly used in untargeted metabolomics to automatically extract metabolic features from liquid chromatography-mass spectrometry (LC-MS) raw data. However, due to the incapability of software to accurately determine chromatographic peak heights/areas for features with poor chromatographic peak shape, automated data processing in untargeted metabolomics faces additional quantitative variation (i.e., computational variation) besides the well-recognized analytical and biological variations. In this work, using multiple biological samples, we investigated how experimental factors, including sample concentrations, LC separation columns, and data processing programs, contribute to computational variation. For example, we found that the peak height (PH)-based quantification is more precise when MS-DIAL was used for data processing. We further systematically compared the different patterns of computational variation between PH- and peak area (PA)-based quantitative measurements. Our results suggest that the magnitude of computational variation is highly consistent at a given concentration. Hence, we proposed a quality control (QC) sample-based correction workflow to minimize computational variation by automatically selecting PH or PA-based measurement for each intensity value. This bioinformatic solution was demonstrated in a metabolomic comparison of leukemia patients before and after chemotherapy. Our novel workflow can be effectively applied on 652 out of 915 metabolic features, and over 31% (206 out of 652) of corrected features showed distinctly changed statistical significance. Overall, this work highlights computational variation, a considerable but underinvestigated quantitative variability in omics-scale quantitative analyses. In addition, the proposed bioinformatic solution can minimize computational variation, thus providing a more confident statistical comparison among biological groups in quantitative metabolomics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据