4.7 Article

A counting renaissance: combining stochastic mapping and empirical Bayes to quickly detect amino acid sites under positive selection

期刊

BIOINFORMATICS
卷 28, 期 24, 页码 3248-3256

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/bts580

关键词

-

资金

  1. European Research Council under the European Community [278433-PREDEMICS]
  2. ERC [260864]
  3. National Science Foundation [DMS 0856099]
  4. National Institutes of Health [R01 GM086887, R01 HG006139]
  5. Bioinformatics, Statistical Analysis and Evolutionary Core of the UCSD Center for AIDS Research [5P30AI36214]
  6. National Evolutionary Synthesis Center (NESCent)
  7. Division Of Mathematical Sciences
  8. Direct For Mathematical & Physical Scien [0856099] Funding Source: National Science Foundation

向作者/读者索取更多资源

MOTIVATION: Statistical methods for comparing relative rates of synonymous and non-synonymous substitutions maintain a central role in detecting positive selection. To identify selection, researchers often estimate the ratio of these relative rates ( ) at individual alignment sites. Fitting a codon substitution model that captures heterogeneity in across sites provides a reliable way to perform such estimation, but it remains computationally prohibitive for massive datasets. By using crude estimates of the numbers of synonymous and non-synonymous substitutions at each site, counting approaches scale well to large datasets, but they fail to account for ancestral state reconstruction uncertainty and to provide site-specific estimates. RESULTS: We propose a hybrid solution that borrows the computational strength of counting methods, but augments these methods with empirical Bayes modeling to produce a relatively fast and reliable method capable of estimating site-specific values in large datasets. Importantly, our hybrid approach, set in a Bayesian framework, integrates over the posterior distribution of phylogenies and ancestral reconstructions to quantify uncertainty about site-specific estimates. Simulations demonstrate that this method competes well with more-principled statistical procedures and, in some cases, even outperforms them. We illustrate the utility of our method using human immunodeficiency virus, feline panleukopenia and canine parvovirus evolution examples.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据