4.7 Article

TopScore: Using Deep Neural Networks and Large Diverse Data Sets for Accurate Protein Model Quality Assessment

Journal

JOURNAL OF CHEMICAL THEORY AND COMPUTATION
Volume 14, Issue 11, Pages 6117-6126

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.jctc.8b00690

Keywords

-

Funding

  1. CLIB2021 Graduate Cluster Industrial Biotechnology
  2. German Research Foundation (DFG) within the Collaborative Research Center [SFB 1208, TP A03]

Ask authors/readers for more resources

The value of protein models obtained with automated protein structure prediction depends primarily on their accuracy. Protein model quality assessment is thus critical to select the model that can best answer biologically relevant questions from an ensemble of predictions. However, despite many advances in the field, different methods capture different types of errors, begging the question of which method to use. We introduce TopScore, a meta Model Quality Assessment Program (meta-MQAP) that uses deep neural networks to combine scores from 15 different primary predictors to predict accurate residue-wise and whole-protein error estimates. The predictions on six large independent data sets are highly correlated to superposition-independent errors in the model, achieving a Pearson's R-all(2) of 0.93 and 0.78 for whole-protein and residue-wise error predictions, respectively. This is a significant improvement over any of the investigated primary MQAPs, demonstrating that much can be gained by optimally combining different methods and using different and very large data sets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available