☆ 4.6 Article

Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery

JOURNAL OF CHEMINFORMATICS (2019)

期刊

JOURNAL OF CHEMINFORMATICS

卷 11, 期 -, 页码 -

出版社

BMC

DOI: 10.1186/s13321-019-0368-1

关键词

Deep neural networks; Machine learning; Compound-protein interaction; Proteochemometrics; Cheminformatics

类别

Chemistry, Multidisciplinary Computer Science, Information Systems Computer Science, Interdisciplinary Applications

资金

Institute for Information and Communications Technology Promotion (IITP) - Korea Government (MSIP) [2017-0-00398]
National Research Foundation of Korea (NRF) - Ministry of Science, ICT and future Planning [NRF-2017R1A2B2008729]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Analysis of compound-protein interactions (CPIs) has become a crucial prerequisite for drug discovery and drug repositioning. In vitro experiments are commonly used in identifying CPIs, but it is not feasible to discover the molecular and proteomic space only through experimental approaches. Machine learning's advances in predicting CPIs have made significant contributions to drug discovery. Deep neural networks (DNNs), which have recently been applied to predict CPIs, performed better than other shallow classifiers. However, such techniques commonly require a considerable volume of dense data for each training target. Although the number of publicly available CPI data has grown rapidly, public data is still sparse and has a large number of measurement errors. In this paper, we propose a novel method, Multi-channel PINN, to fully utilize sparse data in terms of representation learning. With representation learning, Multi-channel PINN can utilize three approaches of DNNs which are a classifier, a feature extractor, and an end-to-end learner. Multi-channel PINN can be fed with both low and high levels of representations and incorporates each of them by utilizing all approaches within a single model. To fully utilize sparse public data, we additionally explore the potential of transferring representations from training tasks to test tasks. As a proof of concept, Multi-channel PINN was evaluated on fifteen combinations of feature pairs to investigate how they affect the performance in terms of highest performance, initial performance, and convergence speed. The experimental results obtained indicate that the multi-channel models using protein features performed better than single-channel models or multi-channel models using compound features. Therefore, Multi-channel PINN can be advantageous when used with appropriate representations. Additionally, we pretrained models on a training task then finetuned them on a test task to figure out whether Multi-channel PINN can capture general representations for compounds and proteins. We found that there were significant differences in performance between pretrained models and non-pretrained models.

Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery

期刊

JOURNAL OF CHEMINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery

期刊

JOURNAL OF CHEMINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文