4.7 Article

Complementarity of the residue-level protein function and structure predictions in human proteins

Journal

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL
Volume 20, Issue -, Pages 2223-2234

Publisher

ELSEVIER
DOI: 10.1016/j.csbj.2022.05.003

Keywords

Protein structure prediction; Protein function prediction; Intrinsic disorder; Secondary structure; Solvent accessibility; Nucleic acid binding; Evaluation Meta-prediction; Webserver

Funding

  1. National Science Foundation [2125218, 2146027]
  2. Robert J. Mattauch Endowment funds
  3. Tempus Public Foundation [CM-SMP-KA107/466238/2020]
  4. Direct For Computer & Info Scie & Enginr
  5. Div Of Information & Intelligent Systems [2125218] Funding Source: National Science Foundation
  6. Div Of Biological Infrastructure
  7. Direct For Biological Sciences [2146027] Funding Source: National Science Foundation

Ask authors/readers for more resources

This study explores the complementarity of different prediction methods, finding that combining structure-trained and disorder-trained predictions leads to improved quality, and demonstrating that predictions accurately reflect various relationships in experimental data.
Sequence-based predictors of the residue-level protein function and structure cover a broad spectrum of characteristics including intrinsic disorder, secondary structure, solvent accessibility and binding to nucleic acids. They were catalogued and evaluated in numerous surveys and assessments. However, methods focusing on a given characteristic are studied separately from predictors of other characteristics, while they are typically used on the same proteins. We fill this void by studying complementarity of a representative collection of methods that target different predictions using a large, taxonomically consistent, and low similarity dataset of human proteins. First, we bridge the gap between the communities that develop structure-trained vs. disorder-trained predictors of binding residues. Motivated by a recent study of the protein-binding residue predictions, we empirically find that combining the structuretrained and disorder-trained predictors of the DNA-binding and RNA-binding residues leads to substantial improvements in predictive quality. Second, we investigate whether diverse predictors generate results that accurately reproduce relations between secondary structure, solvent accessibility, interaction sites, and intrinsic disorder that are present in the experimental data. Our empirical analysis concludes that predictions accurately reflect all combinations of these relations. Altogether, this study provides unique insights that support combining results produced by diverse residue-level predictors of protein function and structure. (c) 2022 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY license (http://creativecommons. org/licenses/by/4.0/).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available