☆ 4.3 Article

Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2022)

Journal

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

Volume 90, Issue 1, Pages 58-72

Publisher

WILEY

DOI: 10.1002/prot.26186

Keywords

inter-residue distance prediction; protein quality assessment; protein structure prediction

Funding

NSF [DBI 1759934, IIS1763246]
NIH [R01GM093123]
Department of Energy [DE-SC0021303, DE-SC0020400, DE-AC05-00OR22725]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Significant progress has been made in protein structure prediction by utilizing deep learning and residue-residue distance prediction since CASP13. The MULTICOM predictor in the 2020 CASP14 experiment ranked well in both tertiary structure prediction and inter-domain structure prediction, showing improvement in template-free modeling and overall performance.

Substantial progresses in protein structure prediction have been made by utilizing deep-learning and residue-residue distance prediction since CASP13. Inspired by the advances, we improve our CASP14 MULTICOM protein structure prediction system by incorporating three new components: (a) a new deep learning-based protein inter-residue distance predictor to improve template-free (ab initio) tertiary structure prediction, (b) an enhanced template-based tertiary structure prediction method, and (c) distance-based model quality assessment methods empowered by deep learning. In the 2020 CASP14 experiment, MULTICOM predictor was ranked seventh out of 146 predictors in tertiary structure prediction and ranked third out of 136 predictors in inter-domain structure prediction. The results demonstrate that the template-free modeling based on deep learning and residue-residue distance prediction can predict the correct topology for almost all template-based modeling targets and a majority of hard targets (template-free targets or targets whose templates cannot be recognized), which is a significant improvement over the CASP13 MULTICOM predictor. Moreover, the template-free modeling performs better than the template-based modeling on not only hard targets but also the targets that have homologous templates. The performance of the template-free modeling largely depends on the accuracy of distance prediction closely related to the quality of multiple sequence alignments. The structural model quality assessment works well on targets for which enough good models can be predicted, but it may perform poorly when only a few good models are predicted for a hard target and the distribution of model quality scores is highly skewed. MULTICOM is available at and .

Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14

Journal

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14

Journal

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper