4.7 Article

Proteome-Wide Structural Computations Provide Insights into Empirical Amino Acid Substitution Matrices

期刊

出版社

MDPI
DOI: 10.3390/ijms24010796

关键词

amino acid substitution; fitness; genetic code; mutation; protein evolution; protein stability; replacement matrices; selection

向作者/读者索取更多资源

This study proposes a neutral continuous fitness-stability model inspired by the Arrhenius law to explain the observed amino acid substitution rates. The model suggests that the rate of substitution depends on a pre-exponential factor and an exponential term reflecting relative fitness. By analyzing stability changes in proteins and their disease-causing potential, the study found a significant positive correlation between energy values and the fitness effect of substitutions, supporting the validity of the model.
The relative contribution of mutation and selection to the amino acid substitution rates observed in empirical matrices is unclear. Herein, we present a neutral continuous fitness-stability model, inspired by the Arrhenius law (q(ij) = a(ij)e(-|delta delta Gij|)). The model postulates that the rate of amino acid substitution ( i -> j) is determined by the product of a pre-exponential factor, which is influenced by the genetic code structure, and an exponential term reflecting the relative fitness of the amino acid substitutions. To assess the validity of our model, we computed changes in stability of 14,094 proteins, for which 137,073,638 in silico mutants were analyzed. These site-specific data were summarized into a 20 square matrix, whose entries, delta delta G(ij) , were obtained after averaging through all the sites in all the proteins. We found a significant positive correlation between these energy values and the disease-causing potential of each substitution, suggesting that the exponential term accurately summarizes the fitness effect. A remarkable observation was that amino acids that were highly destabilizing when acting as the source, tended to have little effect when acting as the destination, and vice versa (source -> destination). The Arrhenius model accurately reproduced the pattern of substitution rates collected in the empirical matrices, suggesting a relevant role for the genetic code structure and a tuning role for purifying selection exerted via protein stability.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据