4.7 Article

Artificial neural network prediction of multilinear gradient retention in reversed-phase HPLC: comprehensive QSRR-based models combining categorical or structural solute descriptors and gradient profile parameters

Journal

ANALYTICAL AND BIOANALYTICAL CHEMISTRY
Volume 407, Issue 4, Pages 1181-1190

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s00216-014-8317-3

Keywords

Reversed-phase liquid chromatography; Multilinear gradient elution; Retention prediction; Artificial neural network; Quantitative structure-retention relationship

Ask authors/readers for more resources

A multilayer artificial neural network (ANN) is used to model the reversed-phase liquid chromatography retention times of 16 selected compounds, including purines, pyrimidines and nucleosides. The analysed data, taken from literature, were collected in acetonitrile-water eluents under the application of 16 different multilinear gradients. The parameters describing the gradient profile together with solute descriptors are considered as the independent variables of an ANN-based model providing the retention time as response. Categorical variables or, alternatively, a selected set of molecular descriptors of computational origin are adopted to represent the solutes. Network training, validation and testing are performed preliminarily using data of 12, 2 and 4 gradients, respectively and successively, to investigate model performance under more severe calibration conditions, with data of 9, 2 and 7 gradients. The proposed approach allows a quite accurate prediction of retention times of the target analytes in external multilinear gradients. Categorical variables can successfully represent the target solutes when the model is called to transfer retention data from calibration to external gradients. In particular, using a five-dimensional bit string to represent the analytes, mean errors on retention times are 2 and 3 % under the most and less favourable calibration conditions, respectively. A comparable performance is observed if the categorical variables are replaced by five molecular descriptors, selected by a genetic algorithm within a large set of structural variables of computational origin.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available