4.7 Article

On Position-Specific Scoring Matrix for Protein Function Prediction

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2010.93

Keywords

Clustering; classification; and association rules; data mining; feature extraction or construction; mining methods and algorithms

Funding

  1. US National Science Foundation (NSF) [IIS-0644366]
  2. Direct For Computer & Info Scie & Enginr
  3. Div Of Information & Intelligent Systems [1347706] Funding Source: National Science Foundation

Ask authors/readers for more resources

While genome sequencing projects have generated tremendous amounts of protein sequence data for a vast number of genomes, substantial portions of most genomes are still unannotated. Despite the success of experimental methods for identifying protein functions, they are often lab intensive and time consuming. Thus, it is only practical to use in silico methods for the genome-wide functional annotations. In this paper, we propose new features extracted from protein sequence only and machine learning-based methods for computational function prediction. These features are derived from a position-specific scoring matrix, which has shown great potential in other bininformatics problems. We evaluate these features using four different classifiers and yeast protein data. Our experimental results show that features derived from the position-specific scoring matrix are appropriate for automatic function annotation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available