☆ 4.3 Article

Deep-learning contact-map guided protein structure prediction in CASP13

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2019)

Journal

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

Volume 87, Issue 12, Pages 1149-1164

Publisher

WILEY

DOI: 10.1002/prot.25792

Keywords

ab initio folding; CASP13; contact prediction; deep convolutional neural networks; deep multiple sequence alignment; protein structure prediction

Funding

Division of Biological Infrastructure [DBI1564756]
National Institute of Allergy and Infectious Diseases [AI134678]
National Institute of General Medical Sciences [GM083107, GM116960]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

We report the results of two fully automated structure prediction pipelines, Zhang-Server and QUARK, in CASP13. The pipelines were built upon the C-I-TASSER and C-QUARK programs, which in turn are based on I-TASSER and QUARK but with three new modules: (a) a novel multiple sequence alignment (MSA) generation protocol to construct deep sequence-profiles for contact prediction; (b) an improved meta-method, NeBcon, which combines multiple contact predictors, including ResPRE that predicts contact-maps by coupling precision-matrices with deep residual convolutional neural-networks; and (c) an optimized contact potential to guide structure assembly simulations. For 50 CASP13 FM domains that lacked homologous templates, average TM-scores of the first models produced by C-I-TASSER and C-QUARK were 28% and 56% higher than those constructed by I-TASSER and QUARK, respectively. For the first time, contact-map predictions demonstrated usefulness on TBM domains with close homologous templates, where TM-scores of C-I-TASSER models were significantly higher than those of I-TASSER models with a P-value <.05. Detailed data analyses showed that the success of C-I-TASSER and C-QUARK was mainly due to the increased accuracy of deep-learning-based contact-maps, as well as the careful balance between sequence-based contact restraints, threading templates, and generic knowledge-based potentials. Nevertheless, challenges still remain for predicting quaternary structure of multi-domain proteins, due to the difficulties in domain partitioning and domain reassembly. In addition, contact prediction in terminal regions was often unsatisfactory due to the sparsity of MSAs. Development of new contact-based domain partitioning and assembly methods and training contact models on sparse MSAs may help address these issues.

Deep-learning contact-map guided protein structure prediction in CASP13

Journal

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Deep-learning contact-map guided protein structure prediction in CASP13

Journal

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper