4.7 Article

Characterization and prediction of residues determining protein functional specificity

Journal

BIOINFORMATICS
Volume 24, Issue 13, Pages 1473-1480

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btn214

Keywords

-

Funding

  1. NCI NIH HHS [P01 CA041086, CA041086] Funding Source: Medline
  2. NHGRI NIH HHS [T32 HG003284] Funding Source: Medline
  3. NIGMS NIH HHS [R01 GM076275, GM076275, P50 GM071508] Funding Source: Medline

Ask authors/readers for more resources

Motivation: Within a homologous protein family, proteins may be grouped into subtypes that share specific functions that are not common to the entire family. Often, the amino acids present in a small number of sequence positions determine each proteins particular function-al specificity. Knowledge of these specificity determining positions (SDPs) aids in protein function prediction, drug design and experimental analysis. A number of sequence-based computational methods have been introduced for identifying SDPs; however, their further development and evaluation have been hindered by the limited number of known experimentally determined SDPs. Results: We combine several bioinformatics resources to automate a process, typically undertaken manually, to build a dataset of SDPs. The resulting large dataset, which consists of SDPs in enzymes, enables us to characterize SDPs in terms of their physicochemical and evolution-ary properties. It also facilitates the large-scale evaluation of sequence-based SDP prediction methods. We present a simple sequence-based SDP prediction method, GroupSim, and show that, surprisingly, it is competitive with a representative set of current methods. We also describe ConsWin, a heuristic that considers sequence conservation of neighboring amino acids, and demonstrate that it improves the performance of all methods tested on our large dataset of enzyme SDPs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available