4.8 Article

CFM-ID 4.0: More Accurate ESI-MS/MS Spectral Prediction and Compound Identification

Journal

ANALYTICAL CHEMISTRY
Volume 93, Issue 34, Pages 11692-11700

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.analchem.1c01465

Keywords

-

Funding

  1. Genome Canada
  2. Genome British Columbia
  3. Genome Alberta [284MBO, 264PRO]
  4. Canada Foundation for Innovation
  5. National Institutes of Health
  6. National Institute of Environmental Health Sciences grant [U2CES030170]
  7. Estonian Research Council grant [PUTJD903]
  8. Natural Sciences and Engineering Research Council
  9. Canadian Institutes of Health Research
  10. Canadian Institute for Advanced Research
  11. Alberta Machine Intelligence Institute

Ask authors/readers for more resources

Mass spectrometry (MS) is commonly used in metabolomics for identifying and annotating metabolites. CFM-ID is a computer program that accurately predicts EST-MS/MS spectrum for a given compound structure, improving compound-to-mass-spectrum prediction and in silico mass-spectrum-to-compound identification. This new version of CFM-ID (version 4.0) demonstrates significantly increased accuracy in both ET-MS/MS spectral prediction and compound identification.
In the field of metabolomics, mass spectrometry (MS) is the method most commonly used for identifying and annotating metabolites. As this typically involves matching a given MS spectrum against an experimentally acquired reference spectral library, this approach is limited by the coverage and size of such libraries (which typically number in the thousands). These experimental libraries can be greatly extended by predicting the MS spectra of known chemical structures (which number in the millions) to create computational reference spectral libraries. To facilitate the generation of predicted spectral reference libraries, we developed CFM-ID, a computer program that can accurately predict EST-MS/MS spectrum for a given compound structure. CFM-ID is one of the best-performing methods for compound-to-mass-spectrum prediction and also one of the top tools for in silico mass-spectrum-to-compound identification. This work improves CFM-ID's ability to predict EST-MS/MS spectra from compounds by (1) learning parameters from features based on the molecular topology, (2) adding a new approach to ring cleavage that models such cleavage as a sequence of simple chemical bond dissociations, and (3) expanding its hand-written rule-based predictor to cover more chemical classes, including acylcarnitines, acylcholines, flavonols, flavones, flavanones, and flavonoid glycosides. We demonstrate that this new version of CFM-ID (version 4.0) is significantly more accurate than previous CFM-ID versions in terms of both ET-MS/MS spectral prediction and compound identification. CFM-ID 4.0 is available at http://cfmid4.wishartlab.com/ as a web server and docker images can be downloaded at https://hub.docker.com/r/wishartlab/cfmid.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available