4.7 Article

Extracting phylogenetic dimensions of coevolution reveals hidden functional signals

Journal

SCIENTIFIC REPORTS
Volume 12, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41598-021-04260-1

Keywords

-

Funding

  1. Stanford Graduate Fellowships
  2. Gerald J. Lieberman Fellowship
  3. European Research Council under the European Union's Horizon 2020 research and innovation programme [851173]
  4. NSF CAREER Award [MCB-1149328]
  5. Allen Discovery Center at Stanford on Systems Modeling of Infection
  6. European Research Council (ERC) [851173] Funding Source: European Research Council (ERC)

Ask authors/readers for more resources

This study introduces a background model that can separate the coevolution signal associated with different phylogenetic clades and within the same clade, and demonstrates that coevolution can be measured at multiple timescales within a protein. The study applies nested coevolution (NC) method to show the importance of poorly conserved residues in protein function and improves the accuracy of structural-contact predictions and functional sector detection.
Despite the structural and functional information contained in the statistical coupling between pairs of residues in a protein, coevolution associated with function is often obscured by artifactual signals such as genetic drift, which shapes a protein's phylogenetic history and gives rise to concurrent variation between protein sequences that is not driven by selection for function. Here, we introduce a background model for phylogenetic contributions of statistical coupling that separates the coevolution signal due to inter-clade and intra-clade sequence comparisons and demonstrate that coevolution can be measured on multiple phylogenetic timescales within a single protein. Our method, nested coevolution (NC), can be applied as an extension to any coevolution metric. We use NC to demonstrate that poorly conserved residues can nonetheless have important roles in protein function. Moreover, NC improved the structural-contact predictions of several coevolution-based methods, particularly in subsampled alignments with fewer sequences. NC also lowered the noise in detecting functional sectors of collectively coevolving residues. Sectors of coevolving residues identified after application of NC were more spatially compact and phylogenetically distinct from the rest of the protein, and strongly enriched for mutations that disrupt protein activity. Thus, our conceptualization of the phylogenetic separation of coevolution provides the potential to further elucidate relationships among protein evolution, function, and genetic diseases.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available