4.5 Article

Three-class classification models of logS and logP derived by using GA-CG-SVM approach

期刊

MOLECULAR DIVERSITY
卷 13, 期 2, 页码 261-268

出版社

SPRINGER
DOI: 10.1007/s11030-009-9108-1

关键词

Aqueous solubility; Lipophilicity; Feature selection; Parameter optimization; Support vector machine (SVM)

资金

  1. 863 Hi-Tech Program [2006AA020402]
  2. National Natural Science Foundation of China [30772651, 20872100]
  3. Youth Foundation of Sichuan Province [08ZQ026-030]

向作者/读者索取更多资源

In this investigation, three-class classification models of aqueous solubility (logS) and lipophilicity (logP) have been developed by using a support vector machine (SVM) method combined with a genetic algorithm (GA) for feature selection and a conjugate gradient method (CG) for parameter optimization. A 5-fold cross-validation and an independent test set method were used to evaluate the SVM classification models. For logS, the overall prediction accuracy is 87.1% for training set and 90.0% for test set. For logP, the overall prediction accuracy is 81.0% for training set and 82.0% for test set. In general, for both logS and logP, the prediction accuracies of three-class models are slightly lower by several percent than those of two-class models. A comparison between the performance of GA-CG-SVM models and that of GA-SVM models shows that the SVM parameter optimization has a significant impact on the quality of SVM classification model.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据