☆ 4.3 Article

Optimal data collection for correlated mutation analysis

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2009)

期刊

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

卷 74, 期 3, 页码 545-555

出版社

WILEY

DOI: 10.1002/prot.22168

关键词

ab-initio structure prediction; correlated mutations; protein structure prediction; residue covariation; contact prediction

类别

Biochemistry & Molecular Biology Biophysics

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The main objective of correlated mutation analysis (CMA) is to predict intra-protein residue-residue interactions from sequence alone. Despite considerable progress in algorithms and computer capabilities, the performance of CMA methods remains quite low. Here we examine whether, and to what extent, the quality of CMA methods depends on the sequences that are included in the multiple sequence alignment (MSA). The results revealed a strong correlation between the number of homologs in an MSA and CMA prediction strength. Furthermore, many of the current methods include only orthologs in the MSA, we found that it is beneficial to include both orthologs and paralogs in the MSA. Remarkably, even remote homologs contribute to the improved accuracy. Based on our findings we put forward an automated data collection procedure, with a minimal coverage of 50% between the query protein and its orthologs and paralogs. This procedure improves accuracy even in the absence of manual curation. In this era of massive sequencing and exploding sequence data, our results suggest that correlated mutation-based methods have not reached their inherent performance limitations and that the role of CMA in structural biology is far from being fulfilled.

Optimal data collection for correlated mutation analysis

期刊

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Optimal data collection for correlated mutation analysis

期刊

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文