4.8 Article

The Structural Determinants of Intra-Protein Compensatory Substitutions

Journal

MOLECULAR BIOLOGY AND EVOLUTION
Volume 39, Issue 4, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/molbev/msac063

Keywords

molecular coevolution; compensatory mutations; epistasis; protein structure; evolutionary rate; substitution mapping

Funding

  1. Max Planck Society

Ask authors/readers for more resources

Compensatory substitutions occur when advantageous mutations restore fitness lost due to previous deleterious mutations. This study presents an atlas of intra-protein compensatory substitutions built using a phylogenetic approach and a dataset of 1,630 bacterial protein families. Through the analysis of evolutionary and structural properties, it is shown that compensatory mutations are rare but widespread in proteins. Coevolving residues are typically evolving slowly and located in the protein core, and they are more often in contact than expected by chance. The findings provide insights into the relationship between protein structure and fitness landscape.
Compensatory substitutions happen when one mutation is advantageously selected because it restores the loss of fitness induced by a previous deleterious mutation. How frequent such mutations occur in evolution and what is the structural and functional context permitting their emergence remain open questions. We built an atlas of intra-protein compensatory substitutions using a phylogenetic approach and a dataset of 1,630 bacterial protein families for which high-quality sequence alignments and experimentally derived protein structures were available. We identified more than 51,000 positions coevolving by the mean of predicted compensatory mutations. Using the evolutionary and structural properties of the analyzed positions, we demonstrate that compensatory mutations are scarce (typically only a few in the protein history) but widespread (the majority of proteins experienced at least one). Typical coevolving residues are evolving slowly, are located in the protein core outside secondary structure motifs, and are more often in contact than expected by chance, even after accounting for their evolutionary rate and solvent exposure. An exception to this general scheme is residues coevolving for charge compensation, which are evolving faster than noncoevolving sites, in contradiction with predictions from simple coevolutionary models, but similar to stem pairs in RNA. While sites with a significant pattern of coevolution by compensatory mutations are rare, the comparative analysis of hundreds of structures ultimately permits a better understanding of the link between the three-dimensional structure of a protein and its fitness landscape.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available