4.6 Article

Effective inter-residue contact definitions for accurate protein fold recognition

期刊

BMC BIOINFORMATICS
卷 13, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/1471-2105-13-292

关键词

Protein structure prediction; Threading; Fold recognition; Structural features; Residue-residue contact; Protein fold

资金

  1. National Institutes of Health [R01GM075004, R01GM097528]
  2. National Science Foundation [EF0850009, IIS0915801, DMS0800568]
  3. National Research Foundation of Korea
  4. Korean Government [NRF-2011-220-C00004]
  5. Direct For Biological Sciences
  6. Emerging Frontiers [0850009] Funding Source: National Science Foundation

向作者/读者索取更多资源

Background: Effective encoding of residue contact information is crucial for protein structure prediction since it has a unique role to capture long-range residue interactions compared to other commonly used scoring terms. The residue contact information can be incorporated in structure prediction in several different ways: It can be incorporated as statistical potentials or it can be also used as constraints in ab initio structure prediction. To seek the most effective definition of residue contacts for template-based protein structure prediction, we evaluated 45 different contact definitions, varying bases of contacts and distance cutoffs, in terms of their ability to identify proteins of the same fold. Results: We found that overall the residue contact pattern can distinguish protein folds best when contacts are defined for residue pairs whose C beta atoms are at 7.0 angstrom or closer to each other. Lower fold recognition accuracy was observed when inaccurate threading alignments were used to identify common residue contacts between protein pairs. In the case of threading, alignment accuracy strongly influences the fraction of common contacts identified among proteins of the same fold, which eventually affects the fold recognition accuracy. The largest deterioration of the fold recognition was observed for beta-class proteins when the threading methods were used because the average alignment accuracy was worst for this fold class. When results of fold recognition were examined for individual proteins, we found that the effective contact definition depends on the fold of the proteins. A larger distance cutoff is often advantageous for capturing spatial arrangement of the secondary structures which are not physically in contact. For capturing contacts between neighboring beta strands, considering the distance between C alpha atoms is better than the C beta-based distance because the side-chain of interacting residues on beta strands sometimes point to opposite directions. Conclusion: Residue contacts defined by C beta-C beta distance of 7.0 angstrom work best overall among tested to identify proteins of the same fold. We also found that effective contact definitions differ from fold to fold, suggesting that using different residue contact definition specific for each template will lead to improvement of the performance of threading.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据