4.7 Article

Comparative chemometric modeling of cytochrome 3A4 inhibitory activity of structurally diverse compounds using stepwise MLR, FA-MLR, PLS, GFA, G/PLS and ANN techniques

期刊

EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY
卷 44, 期 7, 页码 2913-2922

出版社

ELSEVIER FRANCE-EDITIONS SCIENTIFIQUES MEDICALES ELSEVIER
DOI: 10.1016/j.ejmech.2008.12.004

关键词

QSAR; Cytochrome 3A4; FA-MLR; PLS; GFA; ANN

资金

  1. Major Research Grant of University Grant Commission (UGC), New Delhi

向作者/读者索取更多资源

Twenty-eight structurally diverse cytochrome 3A4 (CYP3A4) inhibitors have been subjected to quantitative structure-activity relationship (QSAR) studies. The analyses were performed with electronic, spatial, topological, and thermodynamic descriptors calculated using Cerius 2 version 10 software. The statistical tools used were linear [multiple linear regression with factor analysis as preprocessing step (FA-MLR), stepwise MLR, partial least squares (PLS), genetic function algorithm (GFA), genetic PLS (G/PLS)] and non-linear methods [artificial neural network (ANN)I. All the five linear modeling methods indicate the importance of n-octanol/water partition coefficient (log P) along with different topological and electronic parameters. The best model obtained from the training set (stepwise regression) based on highest external predictive R-2 value and lowest RMSEP value also showed good internal predictive power. Other models like FA-MLR, PLS, GFA and G/PLS are also of statistically significant internal and external validation characteristics. The best model [according to r(m)(2) for the test set, as defined by P.P. Roy, K. Roy, QSAR Comb. Sci. 27 (2008) 302-313] obtained from ANN showed a good r(2) value (determination coefficient between observed and predicted values) for the test set compounds, which was superior to those of other statistical models except the stepwise regression derived model. However, based upon the r(m)(2) value (test set), which penalizes a model for large differences between observed and predicted values, the stepwise MLR model was found to be inferior to other methods except PLS. Considering r(m)(2) value for the whole set, the G/PLS derived model appears to be the best predictive model for this data set. For choosing the best predictive model from among comparable models, r(m)(2), for the whole set calculated based on leave-one-out predicted values of the training set and model-derived predicted values for the test set compounds is suggested to be a good criterion. (C) 2008 Elsevier Masson SAS. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据