期刊
EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY
卷 44, 期 7, 页码 2913-2922出版社
ELSEVIER FRANCE-EDITIONS SCIENTIFIQUES MEDICALES ELSEVIER
DOI: 10.1016/j.ejmech.2008.12.004
关键词
QSAR; Cytochrome 3A4; FA-MLR; PLS; GFA; ANN
资金
- Major Research Grant of University Grant Commission (UGC), New Delhi
Twenty-eight structurally diverse cytochrome 3A4 (CYP3A4) inhibitors have been subjected to quantitative structure-activity relationship (QSAR) studies. The analyses were performed with electronic, spatial, topological, and thermodynamic descriptors calculated using Cerius 2 version 10 software. The statistical tools used were linear [multiple linear regression with factor analysis as preprocessing step (FA-MLR), stepwise MLR, partial least squares (PLS), genetic function algorithm (GFA), genetic PLS (G/PLS)] and non-linear methods [artificial neural network (ANN)I. All the five linear modeling methods indicate the importance of n-octanol/water partition coefficient (log P) along with different topological and electronic parameters. The best model obtained from the training set (stepwise regression) based on highest external predictive R-2 value and lowest RMSEP value also showed good internal predictive power. Other models like FA-MLR, PLS, GFA and G/PLS are also of statistically significant internal and external validation characteristics. The best model [according to r(m)(2) for the test set, as defined by P.P. Roy, K. Roy, QSAR Comb. Sci. 27 (2008) 302-313] obtained from ANN showed a good r(2) value (determination coefficient between observed and predicted values) for the test set compounds, which was superior to those of other statistical models except the stepwise regression derived model. However, based upon the r(m)(2) value (test set), which penalizes a model for large differences between observed and predicted values, the stepwise MLR model was found to be inferior to other methods except PLS. Considering r(m)(2) value for the whole set, the G/PLS derived model appears to be the best predictive model for this data set. For choosing the best predictive model from among comparable models, r(m)(2), for the whole set calculated based on leave-one-out predicted values of the training set and model-derived predicted values for the test set compounds is suggested to be a good criterion. (C) 2008 Elsevier Masson SAS. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据