4.3 Article

Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14

Journal

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS
Volume 90, Issue 1, Pages 58-72

Publisher

WILEY
DOI: 10.1002/prot.26186

Keywords

inter-residue distance prediction; protein quality assessment; protein structure prediction

Funding

  1. NSF [DBI 1759934, IIS1763246]
  2. NIH [R01GM093123]
  3. Department of Energy [DE-SC0021303, DE-SC0020400, DE-AC05-00OR22725]

Ask authors/readers for more resources

Significant progress has been made in protein structure prediction by utilizing deep learning and residue-residue distance prediction since CASP13. The MULTICOM predictor in the 2020 CASP14 experiment ranked well in both tertiary structure prediction and inter-domain structure prediction, showing improvement in template-free modeling and overall performance.
Substantial progresses in protein structure prediction have been made by utilizing deep-learning and residue-residue distance prediction since CASP13. Inspired by the advances, we improve our CASP14 MULTICOM protein structure prediction system by incorporating three new components: (a) a new deep learning-based protein inter-residue distance predictor to improve template-free (ab initio) tertiary structure prediction, (b) an enhanced template-based tertiary structure prediction method, and (c) distance-based model quality assessment methods empowered by deep learning. In the 2020 CASP14 experiment, MULTICOM predictor was ranked seventh out of 146 predictors in tertiary structure prediction and ranked third out of 136 predictors in inter-domain structure prediction. The results demonstrate that the template-free modeling based on deep learning and residue-residue distance prediction can predict the correct topology for almost all template-based modeling targets and a majority of hard targets (template-free targets or targets whose templates cannot be recognized), which is a significant improvement over the CASP13 MULTICOM predictor. Moreover, the template-free modeling performs better than the template-based modeling on not only hard targets but also the targets that have homologous templates. The performance of the template-free modeling largely depends on the accuracy of distance prediction closely related to the quality of multiple sequence alignments. The structural model quality assessment works well on targets for which enough good models can be predicted, but it may perform poorly when only a few good models are predicted for a hard target and the distribution of model quality scores is highly skewed. MULTICOM is available at and .

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available