4.7 Article

A Fast and Interpretable Deep Learning Approach for Accurate Electrostatics-Driven pKa Predictions in Proteins

期刊

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jctc.2c00308

关键词

-

资金

  1. FCT [SFRH/BD/136226/2018, CEECIND/02300/2017, UIDB/04046/2020, UIDP/04046/2020]
  2. European Union [101017567]
  3. Bayer AG Life Science Collaboration (Explainable AI)

向作者/读者索取更多资源

In this study, deep learning models trained on a dataset of 6 million theoretically determined pK(a) shifts successfully inferred the electrostatic contributions of different chemical groups and the importance of solvent exposure. The models demonstrated the best accuracy in a test set and significantly outperformed physics-based methods in terms of inference speed.
Existing computational methods for estimating pK(a) values in proteins rely on theoretical approximations and lengthy computations. In this work, we use a data set of 6 million theoretically determined pK(a) shifts to train deep learning models, which are shown to rival the physics-based predictors. These neural networks managed to infer the electrostatic contributions of different chemical groups and learned the importance of solvent exposure and close interactions, including hydrogen bonds. Although trained only using theoretical data, our pKAI+ model displayed the best accuracy in a test set of similar to 750 experimental values. Inference times allow speedups of more than 1000x compared to physics-based methods. By combining speed, accuracy, and a reasonable understanding of the underlying physics, our models provide a game-changing solution for fast estimations of macroscopic pK(a) values from ensembles of microscopic values as well as for many downstream applications such as molecular docking and constant-pH molecular dynamics simulations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据