☆ 4.6 Article

An accurate and interpretable deep learning model for environmental properties prediction using hybrid molecular representations

AICHE JOURNAL (2022)

期刊

AICHE JOURNAL

卷 68, 期 6, 页码 -

出版社

WILEY

DOI: 10.1002/aic.17634

关键词

deep learning network; interpretability; lipophilicity; message-passing neural network; QSPR

类别

Engineering, Chemical

资金

Chongqing Joint Chinese Medicine Scientific Research Project [2020ZY023984]
National Natural Science Foundation of China [21878028]
Research Foundation of Chongqing University of Science and Technology [ckrc2019006]
National Natural Science Foundation for Excellent Young Scientists of China [22122802]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study developed an accurate and interpretable deep neural network (AI-DNN) model for predicting lipophilicity. A hybrid method of molecular representation, combining directed message passing neural networks and fixed molecule-level features, was employed to capture the local and global features of molecules. The proposed model demonstrated promising predictive accuracy and discriminative power in structural and stereoisomers. The use of Monte Carlo Tree Search allowed for interpretation of the model, which is important in fields with a high demand for interpretable deep networks, such as green solvent design and drug discovery.

Lipophilicity, as quantified by the decimal logarithm of the octanol-water partition coefficient (log K-OW), is an essential environmental property. Deep neural networks (DNNs) based quantitative structure-property relationship (QSPR) studies have received more and more attention because of their excellent performance for prediction. However, the black-box nature of DNNs limits the application range where interpretability is essential. Hence, this study aims to develop an accurate and interpretable deep neural network (AI-DNN) model for log K-OW prediction. A hybrid method of molecular representation was employed to guarantee the accuracy of the proposed AI-DNN model. The hybrid molecular representations are able to integrate the directed message passing neural networks (D-MPNNs) learned molecular representations and the fixed molecule-level features of CDK descriptors, and can capture both the local and the global features of overall molecule. The performance analysis shows that the proposed QSPR model exhibits promising predictive accuracy and discriminative power in the structural isomers and stereoisomers. Moreover, the Monte Carlo Tree Search (MCTS) approach was used to interpret the proposed AI-DNN model by identifying the molecular substructures contributed to the lipophilicity. This interpretability can be applied to critical fields where there is a high demand for interpretable deep networks, such as green solvent design and drug discovery.

An accurate and interpretable deep learning model for environmental properties prediction using hybrid molecular representations

期刊

AICHE JOURNAL

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

An accurate and interpretable deep learning model for environmental properties prediction using hybrid molecular representations

期刊

AICHE JOURNAL

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文