期刊
PROTEIN AND PEPTIDE LETTERS
卷 17, 期 4, 页码 464-472出版社
BENTHAM SCIENCE PUBL LTD
DOI: 10.2174/092986610790963654
关键词
Subcellular location of proteins; Minimum Redundancy Maximum Relevance; Feature Selection; Nearest Neighbor Algorithm; Jackknife cross-validation test
资金
- CAS [KSCX2-YW-R-112]
- Shanghai Leading Academic Discipline Project [J50101]
In this paper, we propose a strategy to predict the subcellular locations of proteins by combining various feature selection methods. Firstly, proteins are coded by amino-acid composition and physicochemical properties, then these features are arranged by Minimum Redundancy Maximum Relevance method and further filtered by feature selection procedure. Nearest Neighbor Algorithm is used as a prediction model to predict the protein subcellular locations, and gains a correct prediction rate of 70.63%, evaluated by Jackknife cross-validation. Results of feature selection also enable us to identify the most important protein properties. The prediction software is available for public access on the website http://chemdata.shu.edu.cn/sub22/, which may play a important complementary role to a series of web-server predictors summarized recently in a review by Chou and Shen (Chou, K.C., Shen, H.B. Natural Science, 2009, 2, 63-92, http://www.scirp.org/journal/NS/).
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据