4.6 Article

Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis

期刊

PLOS COMPUTATIONAL BIOLOGY
卷 17, 期 9, 页码 -

出版社

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.1009105

关键词

-

资金

  1. Wellcome Trust [222837/Z/21/Z]
  2. UK Medical Research Council [MR/R008922/1]
  3. French Ministry of Research [ANR-INBS-0010, ANR-19-CE45-0021, DFG: 431572533]
  4. National Research Agency, French MetaboHUB [ANR-INBS-0010, ANR-19-CE45-0021, DFG: 431572533]
  5. BBSRC [BB/T007974/1]
  6. NIH [R01 HL133932-01]
  7. NIHR Imperial Biomedical Research Centre (BRC)
  8. MESRI (Minister of Higher Education, Research and Innovation)
  9. Agence Nationale de la Recherche (ANR) [ANR-19-CE45-0021] Funding Source: Agence Nationale de la Recherche (ANR)
  10. BBSRC [BB/T007974/1] Funding Source: UKRI
  11. MRC [MR/R008922/1] Funding Source: UKRI
  12. Wellcome Trust [222837/Z/21/Z] Funding Source: Wellcome Trust

向作者/读者索取更多资源

The study found that changes in different parameters (such as background set, differential metabolite selection methods, and pathway databases) can significantly alter the results of over-representation analysis in metabolomics, which has practical implications for users and researchers. The study offers some recommendations to help ensure the reliability and reproducibility of pathway analysis in metabolomics.
Over-representation analysis (ORA) is one of the commonest pathway analysis approaches used for the functional interpretation of metabolomics datasets. Despite the widespread use of ORA in metabolomics, the community lacks guidelines detailing its best-practice use. Many factors have a pronounced impact on the results, but to date their effects have received little systematic attention. Using five publicly available datasets, we demonstrated that changes in parameters such as the background set, differential metabolite selection methods, and pathway database used can result in profoundly different ORA results. The use of a non-assay-specific background set, for example, resulted in large numbers of false-positive pathways. Pathway database choice, evaluated using three of the most popular metabolic pathway databases (KEGG, Reactome, and BioCyc), led to vastly different results in both the number and function of significantly enriched pathways. Factors that are specific to metabolomics data, such as the reliability of compound identification and the chemical bias of different analytical platforms also impacted ORA results. Simulated metabolite misidentification rates as low as 4% resulted in both gain of false-positive pathways and loss of truly significant pathways across all datasets. Our results have several practical implications for ORA users, as well as those using alternative pathway analysis methods. We offer a set of recommendations for the use of ORA in metabolomics, alongside a set of minimal reporting guidelines, as a first step towards the standardisation of pathway analysis in metabolomics. Author summary Metabolomics is a rapidly growing field of study involving the profiling of small molecules within an organism. It allows researchers to understand the effects of biological status (such as health or disease) on cellular biochemistry, and has wide-ranging applications, from biomarker discovery and personalised medicine in healthcare to crop protection and food security in agriculture. Pathway analysis helps to understand which biological pathways, representing collections of molecules performing a particular function, may be involved in response to a disease phenotype, or drug treatment, for example. Over-representation analysis (ORA) is perhaps the most common pathway analysis method used in the metabolomics community. However, ORA can give drastically different results depending on the input data and parameters used. Here, we have established the effects of these factors on ORA results using computational modifications applied to five real-world datasets. Based on our results, we offer the research community a set of best-practice recommendations applicable not only to ORA but also to other pathway analysis methods to help ensure the reliability and reproducibility of results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据