4.2 Article

High-Performance Prediction of Functional Residues in Proteins with Machine Learning and Computed Input Features

期刊

BIOPOLYMERS
卷 95, 期 6, 页码 390-400

出版社

WILEY-BLACKWELL
DOI: 10.1002/bip.21589

关键词

protein function; functional residues; THEMATICS; POOL

资金

  1. National Science Foundation [MCB-0843603]
  2. Div Of Molecular and Cellular Bioscience
  3. Direct For Biological Sciences [0843603] Funding Source: National Science Foundation

向作者/读者索取更多资源

One of the major challenges in genomics is to understand the function of gene products from their 3D structures. Computational methods are needed for the high-throughput prediction of the function of proteins from their 3D structure. Methods that identify active sites are important for understanding and annotating the function of proteins. Traditional methods exploiting either sequence similarity or structural similarity can be unreliable and cannot be applied to proteins with novel folds or low homology with other proteins. Here, we present a machine-learning application that combines computed electrostatic, evolutionary, and pocket geometric information for high-performance prediction of catalytic residues. Input features consist of our structure-based theoretical microscopic anomalous titration curve shapes (THEMATICS) electrostatics data, enhanced with sequence-based phylogenetic information from INTREPID and topological pocket information from Con Cavity. Our THEMATICS-based input features are augmented with an additional metric, the theoretical buffer range. With the integration of the three different types of input, each of which performs admirably on its own, significantly better performance is achieved than that of any of these methods by itself This combined method achieves 86.7%, 92.5%, and 93.8% recall of annotated functional residues at 5, 8, and 10% false-positive rates, respectively. (C) 2011 Wiley Periodicals, Inc. Biopolymers 95: 390-400, 2011.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据