期刊
PROTEIN AND PEPTIDE LETTERS
卷 19, 期 4, 页码 398-405出版社
BENTHAM SCIENCE PUBL LTD
DOI: 10.2174/092986612799789404
关键词
DNA-binding proteins; pseudo amino acid composition; random forest (RF); physicochemical character
资金
- National Natural Science Foundation of China [60803102, 61070084]
- NSFC [60496321]
DNA-binding proteins play an important role in most cellular processes, such as gene regulation, recombination, repair, replication, and DNA modification. In this article, an optimal Chou's pseudo amino acid composition (PseAAC) based on physicochemical characters of amino acid is proposed to represent proteins for identifying DNA-binding proteins. Six physicochemical characters of amino acids are utilized to generate the sequence features via the web server PseAAC. The optimal values of two important parameters (correlation factor lambda and weighting factor w) about PseAAC are determined to get the appropriate representation of proteins, which ultimately result in better prediction performance. Experimental results on the benchmark datasets using random forest show that our method is really promising to predict DNA-binding proteins and may at least be a useful supplement tool to existing methods.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据