4.6 Article

Bioactive Molecule Prediction Using Extreme Gradient Boosting

期刊

MOLECULES
卷 21, 期 8, 页码 -

出版社

MDPI
DOI: 10.3390/molecules21080983

关键词

biological data; drug discovery; virtual screening; prediction of biological activity

资金

  1. Ministry of Higher Education (MOHE)
  2. Research Management Centre (RMC) at the Universiti Teknologi Malaysia (UTM) under the Research University Grant Category [VOT Q.J130000.2528.14H75]

向作者/读者索取更多资源

Following the explosive growth in chemical and biological data, the shift from traditional methods of drug discovery to computer-aided means has made data mining and machine learning methods integral parts of today's drug discovery process. In this paper, extreme gradient boosting (Xgboost), which is an ensemble of Classification and Regression Tree (CART) and a variant of the Gradient Boosting Machine, was investigated for the prediction of biological activity based on quantitative description of the compound's molecular structure. Seven datasets, well known in the literature were used in this paper and experimental results show that Xgboost can outperform machine learning algorithms like Random Forest (RF), Support Vector Machines (LSVM), Radial Basis Function Neural Network (RBFN) and Naive Bayes (NB) for the prediction of biological activities. In addition to its ability to detect minority activity classes in highly imbalanced datasets, it showed remarkable performance on both high and low diversity datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据