期刊
TALANTA
卷 259, 期 -, 页码 -出版社
ELSEVIER
DOI: 10.1016/j.talanta.2023.124484
关键词
FT-ICR MS; Orbitrap MS; Molecular formula assignment; Dissolved organic matter
High-resolution mass spectrometry provides compositional information of dissolved organic matter through isotopic assignment, but multiple possible solutions often occur due to measurement deviation and resolving power limitation. To improve result accuracy in an automated manner, a machine-learning-based algorithm was developed, which showed a significant improvement in formula assignment compared to traditional methods.
High-resolution mass spectrometry (HRMS) provides molecular compositional information of dissolved organic matter (DOM) through isotopic assignment from the molecular mass. However, due to the inevitable deviation of molecular mass measurement and the limitation of resolving power, multiple possible solutions frequently occur for a given molecular mass. Lowering the mass deviation threshold and adding assignment restriction rules are often applied to exclude the incorrect solutions, which generally involves time-consuming manual postprocessing of mass data. To improve the result accuracy in an automated manner, we developed a molecular formula assignment algorithm based on machine-learning technology. The method integrated a logistic regression model using manually corrected isotopic composition and the peak features of HRMS data (m/z, signal-tonoise ratio, isotope type, and number, etc.) as training data. The developed model can evaluate the correctness of a candidate formula for the given mass peak based on the peak features. The method was verified by various DOM samples FT-ICR MS data (direct infusion negative mode electrospray), achieving a similar to 90% accuracy (compared to the traditional approach) for formula assignment. The method was applied to a series of NOM samples and showed a significant improvement in formula assignment compared with the mass matching method.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据