Journal
INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II
Volume 13394, Issue -, Pages 289-301Publisher
SPRINGER INTERNATIONAL PUBLISHING AG
DOI: 10.1007/978-3-031-13829-4_24
Keywords
Tannins; QSAR; Quantum semi-empirical descriptors; Feature selection; MLR; ANN
Categories
Funding
- Provincial Science and Technology Grant of Shanxi Province [20210302124588]
- Science and technology innovation project of Shanxi province universities [2019L0683]
Ask authors/readers for more resources
This paper presents a tannin-based QSAR and machine learning pipeline, which uses quantum semi-empirical descriptors and feature selection with a nonlinear artificial neural network. The model achieved good performance in predicting the antioxidant activity of tannins, providing guidance for tannin-based therapeutic design in the future.
Tannins are potential curatives, besides being an effective antioxidants. Here, tannin based QSAR with machine learning pipeline is elucidated. IC50 values of tannins' antioxidant activity were adapted from literature. This was further split into training and testing datasets. Furthermore, quantum semi-empirical descriptors were computed. Out of 277 chemical descriptors, 17 were shortlisted by feature selection Multiple Linear Regression. For the test dataset; R2 = 0.706 and mean absolute error (MAE) = 1.94. For the same dataset using nonlinear artificial neural network (ANN), R2 = 0.858 and MAE = 1.02. Therefore, AMPAC-CODESSA's feature selection and ANN, provides an efficacious tannin-QSAR model aiding tannin-based therapeutic design in future.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available