4.7 Article

True Accuracy of Fast Scoring Functions to Predict High-Throughput Screening Data from Docking Poses: The Simpler the Better

期刊

JOURNAL OF CHEMICAL INFORMATION AND MODELING
卷 61, 期 6, 页码 2788-2797

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.1c00292

关键词

-

资金

  1. Doctoral School of Chemical Sciences (EDSC, University of Strasbourg)

向作者/读者索取更多资源

This study conducted an unbiased evaluation of four scoring functions on a high-confidence experimental screening dataset, revealing that rescoring based on simple interaction fingerprints or interaction graphs outperforms advanced machine learning and deep learning scoring functions in most cases. It also highlights the tendency of deep learning methods to predict affinity values within a narrow range centered on the mean value of training samples, and suggests the importance of pre-existing binding modes in detecting the most potent binders.
Hundreds of fast scoring functions have been developed over the last 20 years to predict binding free energies from three-dimensional structures of protein-ligand complexes. Despite numerous statistical promises, we believe that none of them has been properly validated for daily prospective high-throughput virtual screening studies, mostly because in silico screening challenges usually employ artificially built and biased datasets. We here carry out a fully unbiased evaluation of four scoring functions (Pafnucy, Delta vinaRF20, IFP, and GRIM) on an in-house developed data collection of experimental high-confidence screening data (LIT-PCBA) covering about 3 million data points on 15 diverse pharmaceutical targets. All four scoring functions were applied to rescore the docking poses of LIT-PCBA compounds in conditions mimicking exactly standard drug discovery scenarios and were compared in terms of propensity to enrich true binders in the top 1%-ranked hit lists. Interestingly, rescoring based on simple interaction fingerprints or interaction graphs outperforms state-of-the-art machine learning and deep learning scoring functions in most of the cases. The current study notably highlights the strong tendency of deep learning methods to predict affinity values within a very narrow range centered on the mean value of samples used for training. Moreover, it suggests that knowledge of pre-existing binding modes is the key to detecting the most potent binders.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据