4.7 Article

A novel method of protein secondary structure prediction with high segment overlap measure: Support vector machine approach

期刊

JOURNAL OF MOLECULAR BIOLOGY
卷 308, 期 2, 页码 397-407

出版社

ACADEMIC PRESS LTD
DOI: 10.1006/jmbi.2001.4580

关键词

protein structure prediction; protein secondary structure; support vector machine; supervised learning; the tertiary classifier

向作者/读者索取更多资源

We have introduced a new method of protein secondary structure prediction which is based on the theory of support vector machine (SVM). SVM represents a new approach to supervised pattern classification which has been successfully applied to a wide range of pattern recognition problems, including object recognition, speaker identification, gene function prediction with microarray expression profile, etc. Ln these cases, the performance of SVM either matches or is significantly better than that of traditional machine learning approaches, including neural networks. The first use of the SVM approach to predict protein secondary structure is described here. Unlike the previous studies, we first constructed several binary classifiers, then assembled a tertiary classifier for three secondary structure states (helix, sheet and coil) based on these binary classifiers. The SVM method achieved a good performance of segment overlap accuracy SOV = 76.2 % through sevenfold cross validation on a database of 513 non-homologous protein chains with multiple sequence alignments, which out-performs existing methods. Meanwhile three-state overall per-residue accuracy Q(3) achieved 73.5%, which is at least comparable to existing single prediction methods. Furthermore a useful reliability index for the predictions was developed, hn addition, SVM has many attractive features, including effective avoidance of overfitting, the ability to handle large feature spaces, information condensing of the given data set, etc. The SVM method is conveniently applied to many ether pattern classification tasks in biology. (C) 2001 Academic Press.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据