期刊
JOURNAL OF CHEMICAL INFORMATION AND MODELING
卷 -, 期 -, 页码 -出版社
AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.2c00695
关键词
-
类别
资金
- CAPES [T32GM067547]
- Chan Zuckerberg Initiative D A F , an advised fund of the Silicon Valley Community Foundation
- Pfizer
- CTSI TL1 Postdoctoral Fellowship
- NIH [23038.004007/2014-82]
- FAPEMIG through a CAPES
- UCSF Graduate Division
- CAPES [310197/2021-0]
- CNPq
- CAPES
- CNPq [T32GM067547]
- FAPEMIG [T32GM067547, 23038.004007/2014-82]
- [312143/2020-6]
- [APQ-01834-21]
This paper introduces a machine learning-based drug discovery method that utilizes the LUNA toolkit to calculate and encode protein-ligand interactions into new fingerprints. The method also provides visual strategies for interpretable fingerprints. Experimental results show that this method outperforms traditional fingerprints in reproducing scores and identifying similarities. Therefore, LUNA and its interface fingerprints are promising approaches for machine learning-based drug discovery.
Machine learning-based drug discovery success depends on molecular representation. Yet traditional molecular fingerprints omit both the protein and pointers back to structural information that would enable better model interpretability. Therefore, we propose LUNA, a Python 3 toolkit that calculates and encodes protein-ligand interactions into new hashed fingerprints inspired by Extended Connectivity FingerPrint (ECFP): EIFP (Extended Interaction FingerPrint), FIFP (Functional Interaction FingerPrint), and Hybrid Interaction FingerPrint (HIFP). LUNA also provides visual strategies to make the fingerprints interpretable. We performed three major experiments exploring the fingerprints' use. First, we trained machine learning models to reproduce DOCK3.7 scores using 1 million docked Dopamine D4 complexes. We found that EIFP-4,096 performed (R-2 = 0.61) superior to related molecular and interaction fingerprints. Second, we used LUNA to support interpretable machine learning models. Finally, we demonstrate that interaction fingerprints can accurately identify similarities across molecular complexes that other fingerprints overlook. Hence, we envision LUNA and its interface fingerprints as promising methods for machine learning-based virtual screening campaigns. LUNA is freely available at https://github.com/keiserlab/LUNA.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据