4.6 Article

Random-forest model for drug-target interaction prediction via Kullbeck-Leibler divergence

期刊

JOURNAL OF CHEMINFORMATICS
卷 14, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s13321-022-00644-1

关键词

Chemocentric; 3D Molecular Fingerprint; 3D Similarity; Drug-Target Interaction Feature; Nonparametric Density Estimation; Kullbeck-Leibler Divergence; Machine Learning

资金

  1. Basic Science Research Program of the National Research Foundation of Korea (NRF) - Ministry of Education, Science and Technology [:2017R1E1A1A01076642]

向作者/读者索取更多资源

Virtual screening has improved early stage drug discovery success rates, with recent methods benefiting from advances in machine learning and chemical information. Creative extraction of drug features is crucial for predicting drug-target interactions, and the development of a feature-driven classification model has shown promise in this area.
Virtual screening has significantly improved the success rate of early stage drug discovery. Recent virtual screening methods have improved owing to advances in machine learning and chemical information. Among these advances, the creative extraction of drug features is important for predicting drug-target interaction (DTI), which is a large-scale virtual screening of known drugs. Herein, we report Kullbeck-Leibler divergence (KLD) as a DTI feature and the feature-driven classification model applicable to DTI prediction. For the purpose, E3FP three-dimensional (3D) molecular fingerprints of drugs as a molecular representation allow the computation of 3D similarities between ligands within each target (Q-Q matrix) to identify the uniqueness of pharmacological targets and those between a query and a ligand (Q-L vector) in DTIs. The 3D similarity matrices are transformed into probability density functions via kernel density estimation as a nonparametric estimation. Each density model can exploit the characteristics of each pharmacological target and measure the quasi-distance between the ligands. Furthermore, we developed a random forest model from the KLD feature vectors to successfully predict DTIs for representative 17 targets (mean accuracy: 0.882, out-of-bag score estimate: 0.876, ROC AUC: 0.990). The method is applicable for 2D chemical similarity.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据