4.7 Article

WDL-RF: predicting bioactivities of ligand molecules acting with G protein-coupled receptors by combining weighted deep learning and random forest

期刊

BIOINFORMATICS
卷 34, 期 13, 页码 2271-2282

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/bty070

关键词

-

资金

  1. National Science Foundation of China [81771478, 61571233]
  2. key University Science Research Project of Jiangsu Province [17KJA510003]
  3. National Science Foundation [DBI1564756]
  4. NATIONAL CANCER INSTITUTE [T32CA140044] Funding Source: NIH RePORTER
  5. NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES [R01GM083107, R01GM116960] Funding Source: NIH RePORTER

向作者/读者索取更多资源

Motivation: Precise assessment of ligand bioactivities (including IC50, EC50, K-i, K-d, etc.) is essential for virtual screening and lead compound identification. However, not all ligands have experimentally determined activities. In particular, many G protein-coupled receptors (GPCRs), which are the largest integral membrane protein family and represent targets of nearly 40% drugs on the market, lack published experimental data about ligand interactions. Computational methods with the ability to accurately predict the bioactivity of ligands can help efficiently address this problem. Results: We proposed a new method, WDL-RF, using weighted deep learning and random forest, to model the bioactivity of GPCR-associated ligand molecules. The pipeline of our algorithm consists of two consecutive stages: (i) molecular fingerprint generation through a new weighted deep learning method, and (ii) bioactivity calculations with a random forest model; where one uniqueness of the approach is that the model allows end-to-end learning of prediction pipelines with input ligands being of arbitrary size. The method was tested on a set of twenty-six non-redundant GPCRs that have a high number of active ligands, each with 200-4000 ligand associations. The results from our benchmark show that WDL-RF can generate bioactivity predictions with an average root-mean square error 1.33 and correlation coefficient (r(2)) 0.80 compared to the experimental measurements, which are significantly more accurate than the control predictors with different molecular fingerprints and descriptors. In particular, data-driven molecular fingerprint features, as extracted from the weighted deep learning models, can help solve deficiencies stemming from the use of traditional hand-crafted features and significantly increase the efficiency of short molecular finger-prints in virtual screening.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据