4.8 Article

Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography-Mass Spectrometry Metabolomics Datasets

期刊

ANALYTICAL CHEMISTRY
卷 94, 期 14, 页码 5493-5503

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.analchem.1c03592

关键词

-

资金

  1. Health Data Research UK - UK Medical Research Council
  2. Engineering and Physical Sciences Research Council
  3. Economic and Social Research Council
  4. Department of Health and Social Care (England)
  5. Chief Scientist Office of the Scottish Government Health and Social Care Directorates
  6. Health and Social Care Research and Development Division (Welsh Government)
  7. Public Health Agency (Northern Ireland)
  8. British Heart Foundation
  9. Wellcome Trust
  10. NIH/NHLBI [R01HL111362, R01HL133932, R01HL149779, 75N92020D00007]
  11. EU COMBI-BIO project (FP7) [305422]
  12. EU PhenoMeNal project (H2020) [654241]
  13. National Institutes of Health [R01HL111362, R01HL133932]
  14. Medical Research Council
  15. National Institute for Health Research [MC_PC_12025]
  16. BBSRC [BB/T007974/1]
  17. MRC Centre for Environment and Health [MR/S019669/1]
  18. UK Dementia Research Institute [MC_PC_17114]
  19. Alzheimer's Society
  20. Alzheimer's Research UK
  21. Home Office [780-TETRA]
  22. Medical Research Council [MR/R023484/1, MR/L01632X/1]
  23. Economic and Social Research Council [MR/R023484/1]
  24. National Heart, Lung, and Blood Institute (NHLBI)
  25. Erasmus Medical Center and Erasmus University, Rotterdam
  26. Netherlands Organization for the Health Research and Development (ZonMw)
  27. Research Institute for Diseases in the Elderly (RIDE)
  28. Ministry of Education, Culture and Science
  29. Ministry for Health, Welfare and Sports
  30. European Commission (DG XII)
  31. Municipality of Rotterdam
  32. [75N92020D00001]
  33. [HHSN268201500003I]
  34. [N01-HC-95159]
  35. [75N92020D00005]
  36. [N01-HC-95160]
  37. [75N92020D00002]
  38. [N01-HC-95161]
  39. [75N92020D00003]
  40. [N01-HC-95162]
  41. [75N92020D00006]
  42. [N01-HC-95163]
  43. [75N92020D00004]
  44. [N01-HC-95164]
  45. [N01-HC-95165]
  46. [N01-HC-95166]
  47. [N01-HC-95167]
  48. [N01-HC-95168]
  49. [N01-HC-95169]
  50. [UL1-TR-000040]
  51. [UL1-TR-001079]
  52. [UL1-TR-001420]
  53. [UL1-TR-001881]
  54. [DK063491]
  55. BBSRC [BB/T007974/1] Funding Source: UKRI

向作者/读者索取更多资源

This article introduces a method to find feature correspondence between two similar LC-MS metabolomics experiments or batches using only the features' retention time, mass-to-charge ratio, and feature intensity. The effectiveness of the method is demonstrated through experiments on real and synthetic datasets.
Integration of multiple datasets can greatly enhance bioanalytical studies, for example, by increasing power to discover and validate biomarkers. In liquid chromatography-mass spectrometry (LC-MS) metabolomics, it is especially hard to combine untargeted datasets since the majority of metabolomic features are not annotated and thus cannot be matched by chemical identity. Typically, the information available for each feature is retention time (RT), mass-to-charge ratio (m/z), and feature intensity (FI). Pairs of features from the same metabolite in separate datasets can exhibit small but significant differences, making matching very challenging. Current methods to address this issue are too simple or rely on assumptions that cannot be met in all cases. We present a method to find feature correspondence between two similar LC-MS metabolomics experiments or batches using only the features' RT, m/z, and FI. We demonstrate the method on both real and synthetic datasets, using six orthogonal validation strategies to gauge the matching quality. In our main example, 4953 features were uniquely matched, of which 585 (96.8%) of 604 manually annotated features were correct. In a second example, 2324 features could be uniquely matched, with 79 (90.8%) out of 87 annotated features correctly matched. Most of the missed annotated matches are between features that behave very differently from modeled inter-dataset shifts of RT, MZ, and FI. In a third example with simulated data with 4755 features per dataset, 99.6% of the matches were correct. Finally, the results of matching three other dataset pairs using our method are compared with a published alternative method, metabCombiner, showing the advantages of our approach. The method can be applied using M2S (Match 2 Sets), a free, open-source MATLAB toolbox, available at https://github.com/rjdossan/M2S.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据