☆ 4.3 Article

Ranking predicted protein structures with support vector regression

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2008)

期刊

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

卷 71, 期 3, 页码 1175-1182

出版社

WILEY

DOI: 10.1002/prot.21809

关键词

protein structure prediction; scoring function; support vector regression; machine learning; consensus-based feature

类别

Biochemistry & Molecular Biology Biophysics

资金

NCRR NIH HHS [P41 RR11823, P41 RR011823] Funding Source: Medline
NHGRI NIH HHS [R33 HG003070] Funding Source: Medline
PHS HHS [R33 HGO03070] Funding Source: Medline

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Protein structure prediction is an important problem of both intellectual and practical interest. Most protein structure prediction approaches generate multiple candidate models first, and then use a scoring function to select the best model among these candidates. In this work, we develop a scoring function using support vector regression (SVR). Both consensus-based features and features from individual structures are extracted from a training data set containing native protein structures and predicted structural models submitted to CASP5 and CASP6. The SVR learns a scoring function that is a linear combination of these features. We test this scoring function on two data sets. First, when used to rank server models submitted to CASP7, the SVR score selects predictions that are comparable to the best performing server in CASP7, Zhang-Server, and significantly better than all the other servers. Even if the SVR score is not allowed to select Zhang-Server models, the SVR score still selects predictions that are significantly better than all the other servers. In addition, the SVR is able to select significantly better models and yield significantly better Pearson correlation coefficients than the two best Quality Assessment groups in CASP7, QA556 (LEE), and QA634 (Pcons). Second, this work aims to improve the ability of the Robetta server to select best models, and hence we evaluate the performance of the SVR score on ranking the Robetta server template-based models for the CASP7 targets. The SVR selects significantly better models than the Robetta K*Sync consensus alignment score.

Ranking predicted protein structures with support vector regression

期刊

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Ranking predicted protein structures with support vector regression

期刊

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文