4.7 Review

Advancements in capturing and mining mass spectrometry data are transforming natural products research

Journal

NATURAL PRODUCT REPORTS
Volume 38, Issue 11, Pages 2066-2082

Publisher

ROYAL SOC CHEMISTRY
DOI: 10.1039/d1np00040c

Keywords

-

Funding

  1. Danish National Research Foundation [DNRF137]
  2. NIH [GM107550]
  3. ASDI eScience grant from the Netherlands eScience Center [ASDI.2017.030]
  4. Intramural Research Program of National Institute of Environmental Health Sciences of the NIH [ES103363-01]

Ask authors/readers for more resources

This article discusses the importance of mass spectrometry technology in natural products research and the new trend of open MS data and data mining tools in this field. Over the past 5 years, this shift has rapidly developed with huge potential for the future. The article proposes a new framework and challenges for utilizing repository data, highlighting the importance of data openness and data mining as the next important development stage in natural products research.
Covering: 2016 up to 2021 Mass spectrometry (MS) is an essential technology in natural products research with MS fragmentation (MS/MS) approaches becoming a key tool. Recent advancements in MS yield dense metabolomics datasets which have been, conventionally, used by individual labs for individual projects; however, a shift is brewing. The movement towards open MS data (and other structural characterization data) and accessible data mining tools is emerging in natural products research. Over the past 5 years, this movement has rapidly expanded and evolved with no slowdown in sight; the capabilities of today vastly exceed those of 5 years ago. Herein, we address the analysis of individual datasets, a situation we are calling the '2021 status quo', and the emergent framework to systematically capture sample information (metadata) and perform repository-scale analyses. We evaluate public data deposition, discuss the challenges of working in the repository scale, highlight the challenges of metadata capture and provide illustrative examples of the power of utilizing repository data and the tools that enable it. We conclude that the advancements in MS data collection must be met with advancements in how we utilize data; therefore, we argue that open data and data mining is the next evolution in obtaining the maximum potential in natural products research.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available