4.7 Article

Improving Docking Power for Short Peptides Using Random Forest

期刊

JOURNAL OF CHEMICAL INFORMATION AND MODELING
卷 61, 期 6, 页码 3074-3090

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.1c00573

关键词

-

资金

  1. National Institute of General Medical Sciences of the National Institutes of Health [R01GM096888, R01GM069832]

向作者/读者索取更多资源

Therapeutic peptides have gained significant interest as drugs, but peptide docking remains challenging. By using random forest classifiers, the docking efficiency of peptides can be greatly improved, paving the way for successful peptide docking rates comparable to those of small molecules.
In recent years, therapeutic peptides have gained a lot interest as demonstrated by the 60 peptides approved as drugs in major markets and 150+ peptides currently in clinical trials. However, while small molecule docking is routinely used in rational drug design efforts, docking peptides has proven challenging partly because docking scoring functions, developed and calibrated for small molecules, perform poorly for these molecules. Here, we present random forest classifiers trained to discriminate correctly docked peptides. We show that, for a testing set of 47 protein-peptide complexes, structurally dissimilar from the training set and previously used to benchmark AutoDock Vina's ability to dock short peptides, these random forest classifiers improve docking power from similar to 25% for AutoDock scoring functions to an average of similar to 70%. These results pave the way for peptide-docking success rates comparable to those of small molecule docking. To develop these classifiers, we compiled the ProptPep37_2021 data set, a curated, high-quality set of 322 crystallographic protein-peptides complexes annotated with structural similarity information. The data set also provides a collection of high-quality putative poses with a range of deviations from the crystallographic pose, providing correct and incorrect poses (i.e., decoys) of the peptide for each entry. The ProptPep37_2021 data set as well as the classifiers presented here are freely available.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据