4.7 Article

Complementarity of the residue-level protein function and structure predictions in human proteins

期刊

出版社

ELSEVIER
DOI: 10.1016/j.csbj.2022.05.003

关键词

Protein structure prediction; Protein function prediction; Intrinsic disorder; Secondary structure; Solvent accessibility; Nucleic acid binding; Evaluation Meta-prediction; Webserver

资金

  1. National Science Foundation [2125218, 2146027]
  2. Robert J. Mattauch Endowment funds
  3. Tempus Public Foundation [CM-SMP-KA107/466238/2020]
  4. Direct For Computer & Info Scie & Enginr
  5. Div Of Information & Intelligent Systems [2125218] Funding Source: National Science Foundation
  6. Div Of Biological Infrastructure
  7. Direct For Biological Sciences [2146027] Funding Source: National Science Foundation

向作者/读者索取更多资源

This study explores the complementarity of different prediction methods, finding that combining structure-trained and disorder-trained predictions leads to improved quality, and demonstrating that predictions accurately reflect various relationships in experimental data.
Sequence-based predictors of the residue-level protein function and structure cover a broad spectrum of characteristics including intrinsic disorder, secondary structure, solvent accessibility and binding to nucleic acids. They were catalogued and evaluated in numerous surveys and assessments. However, methods focusing on a given characteristic are studied separately from predictors of other characteristics, while they are typically used on the same proteins. We fill this void by studying complementarity of a representative collection of methods that target different predictions using a large, taxonomically consistent, and low similarity dataset of human proteins. First, we bridge the gap between the communities that develop structure-trained vs. disorder-trained predictors of binding residues. Motivated by a recent study of the protein-binding residue predictions, we empirically find that combining the structuretrained and disorder-trained predictors of the DNA-binding and RNA-binding residues leads to substantial improvements in predictive quality. Second, we investigate whether diverse predictors generate results that accurately reproduce relations between secondary structure, solvent accessibility, interaction sites, and intrinsic disorder that are present in the experimental data. Our empirical analysis concludes that predictions accurately reflect all combinations of these relations. Altogether, this study provides unique insights that support combining results produced by diverse residue-level predictors of protein function and structure. (c) 2022 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY license (http://creativecommons. org/licenses/by/4.0/).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据