4.7 Article

Machine-learning assisted molecular formula assignment to high-resolution mass spectrometry data of dissolved organic matter

Journal

TALANTA
Volume 259, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.talanta.2023.124484

Keywords

FT-ICR MS; Orbitrap MS; Molecular formula assignment; Dissolved organic matter

Ask authors/readers for more resources

High-resolution mass spectrometry provides compositional information of dissolved organic matter through isotopic assignment, but multiple possible solutions often occur due to measurement deviation and resolving power limitation. To improve result accuracy in an automated manner, a machine-learning-based algorithm was developed, which showed a significant improvement in formula assignment compared to traditional methods.
High-resolution mass spectrometry (HRMS) provides molecular compositional information of dissolved organic matter (DOM) through isotopic assignment from the molecular mass. However, due to the inevitable deviation of molecular mass measurement and the limitation of resolving power, multiple possible solutions frequently occur for a given molecular mass. Lowering the mass deviation threshold and adding assignment restriction rules are often applied to exclude the incorrect solutions, which generally involves time-consuming manual postprocessing of mass data. To improve the result accuracy in an automated manner, we developed a molecular formula assignment algorithm based on machine-learning technology. The method integrated a logistic regression model using manually corrected isotopic composition and the peak features of HRMS data (m/z, signal-tonoise ratio, isotope type, and number, etc.) as training data. The developed model can evaluate the correctness of a candidate formula for the given mass peak based on the peak features. The method was verified by various DOM samples FT-ICR MS data (direct infusion negative mode electrospray), achieving a similar to 90% accuracy (compared to the traditional approach) for formula assignment. The method was applied to a series of NOM samples and showed a significant improvement in formula assignment compared with the mass matching method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available