4.4 Article

PeakForest: a multi-platform digital infrastructure for interoperable metabolite spectral data and metadata management

Journal

METABOLOMICS
Volume 18, Issue 6, Pages -

Publisher

SPRINGER
DOI: 10.1007/s11306-022-01899-3

Keywords

Curation; Database; FAIR; Interoperability; Metabolite identification; Spectral library

Funding

  1. French National Facility in Metabolomics & Fluxomics, MetaboHUB [11-INBS-0010]
  2. French ANR

Ask authors/readers for more resources

A comprehensive resource, PeakForest addresses the technical bottleneck in annotating large-scale spectral data and identifying metabolites in metabolomics laboratories. It provides a structured database with tools for data curation and browsing, facilitating the sharing and integration of spectral data across laboratories.
Introduction Accuracy of feature annotation and metabolite identification in biological samples is a key element in metabolomics research. However, the annotation process is often hampered by the lack of spectral reference data in experimental conditions, as well as logistical difficulties in the spectral data management and exchange of annotations between laboratories. Objectives To design an open-source infrastructure allowing hosting both nuclear magnetic resonance (NMR) and mass spectra (MS), with an ergonomic Web interface and Web services to support metabolite annotation and laboratory data management. Methods We developed the PeakForest infrastructure, an open-source Java tool with automatic programming interfaces that can be deployed locally to organize spectral data for metabolome annotation in laboratories. Standardized operating procedures and formats were included to ensure data quality and interoperability, in line with international recommendations and FAIR principles. Results PeakForest is able to capture and store experimental spectral MS and NMR metadata as well as collect and display signal annotations. This modular system provides a structured database with inbuilt tools to curate information, browse and reuse spectral information in data treatment. PeakForest offers data formalization and centralization at the laboratory level, facilitating shared spectral data across laboratories and integration into public databases. Conclusion PeakForest is a comprehensive resource which addresses a technical bottleneck, namely large-scale spectral data annotation and metabolite identification for metabolomics laboratories with multiple instruments. PeakForest databases can be used in conjunction with bespoke data analysis pipelines in the Galaxy environment, offering the opportunity to meet the evolving needs of metabolomics research.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available