4.7 Article

Deep Learning Based Prediction of Gas Chromatographic Retention Indices for a Wide Variety of Polar and Mid-Polar Liquid Stationary Phases

Journal

Publisher

MDPI
DOI: 10.3390/ijms22179194

Keywords

gas chromatography; deep learning; untargeted analysis; retention index; QSPR; QSRR

Funding

  1. Ministry of Science and Higher Education of the Russian Federation

Ask authors/readers for more resources

The use of deep learning in predicting retention indices has significantly improved accuracy for non-polar stationary phases. This study demonstrates the first application of deep learning for retention index prediction on polar and mid-polar stationary phases, achieving high accuracy. Furthermore, the approach can be directly applied to predict the second dimension retention times if a large enough dataset is available.
Prediction of gas chromatographic retention indices based on compound structure is an important task for analytical chemistry. The predicted retention indices can be used as a reference in a mass spectrometry library search despite the fact that their accuracy is worse in comparison with the experimental reference ones. In the last few years, deep learning was applied for this task. The use of deep learning drastically improved the accuracy of retention index prediction for non-polar stationary phases. In this work, we demonstrate for the first time the use of deep learning for retention index prediction on polar (e.g., polyethylene glycol, DB-WAX) and mid-polar (e.g., DB-624, DB-210, DB-1701, OV-17) stationary phases. The achieved accuracy lies in the range of 16-50 in terms of the mean absolute error for several stationary phases and test data sets. We also demonstrate that our approach can be directly applied to the prediction of the second dimension retention times (GC x GC) if a large enough data set is available. The achieved accuracy is considerably better compared with the previous results obtained using linear quantitative structure-retention relationships and ACD ChromGenius software. The source code and pre-trained models are available online.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available