4.7 Article

Improved cluster ranking in protein-protein docking using a regression approach

Journal

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL
Volume 19, Issue -, Pages 2269-2278

Publisher

ELSEVIER
DOI: 10.1016/j.csbj.2021.04.028

Keywords

Protein docking; Machine learning; Ranking

Funding

  1. NSF [DMS-1664644, CNS-1645681, DBI 1759277, AF 1645512, IIS-1914792]
  2. ONR [N00014-19-1-2571]
  3. NIGMS [R21GM127952, RM1135136]
  4. NIH [R01 GM135930, R35 GM118078, UL54 TR004130]
  5. DOE [DE-AR-0001282]

Ask authors/readers for more resources

The RRPCC method ranks clusters of protein complex conformations based on pairwise comparisons, leading to improved accuracy in ranking acceptable or better quality clusters. Results suggest that internal energy terms play a crucial role in enhancing scoring quality.
We develop a Regression-based Ranking by Pairwise Cluster Comparisons (RRPCC) method to rank clusters of similar protein complex conformations generated by an underlying docking program. The method leverages robust regression to predict the relative quality difference between any pair or clusters and combines these pairwise assessments to form a ranked list of clusters, from higher to lower quality. We apply RRPCC to clusters produced by the automated docking server ClusPro and, depending on the training/validation strategy, we show improvement by 24-100% in ranking acceptable or better quality clusters first, and by 15-100% in ranking medium or better quality clusters first. We compare the RRPCC-ClusPro combination to a number of alternatives, and show that very different machine learning approaches to scoring docked structures yield similar success rates. Finally, we discuss the current limitations on sampling and scoring, looking ahead to further improvements. Interestingly, some features important for improved scoring are internal energy terms that occur only due to the local energy minimization applied in the refinement stage following rigid body docking. (C) 2021 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available