4.6 Article

Capturing coevolutionary signals in repeat proteins

Journal

BMC BIOINFORMATICS
Volume 16, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/s12859-015-0648-3

Keywords

Direct coupling analysis; Repeat proteins; Direct information; Co-evolution

Funding

  1. Consejo Nacional de Investigaciones Cientificas y Tecnicas de Argentina (CONICET)
  2. Agencia Nacional de Promocion Cientifica y Tecnologica (ANPCyT)
  3. ERCStG [306312]
  4. European Research Council (ERC) [306312] Funding Source: European Research Council (ERC)

Ask authors/readers for more resources

Background: The analysis of correlations of amino acid occurrences in globular domains has led to the development of statistical tools that can identify native contacts - portions of the chains that come to close distance in folded structural ensembles. Here we introduce a direct coupling analysis for repeat proteins - natural systems for which the identification of folding domains remains challenging. Results: We show that the inherent translational symmetry of repeat protein sequences introduces a strong bias in the pair correlations at precisely the length scale of the repeat-unit. Equalizing for this bias in an objective way reveals true co-evolutionary signals from which local native contacts can be identified. Importantly, parameter values obtained for all other interactions are not significantly affected by the equalization. We quantify the robustness of the procedure and assign confidence levels to the interactions, identifying the minimum number of sequences needed to extract evolutionary information in several repeat protein families. Conclusions: The overall procedure can be used to reconstruct the interactions at distances larger than repeat-pairs, identifying the characteristics of the strongest couplings in each family, and can be applied to any system that appears translationally symmetric.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available