4.8 Article

Flexible Mixture Model Approaches That Accommodate Footprint Size Variability for Robust Detection of Balancing Selection

期刊

MOLECULAR BIOLOGY AND EVOLUTION
卷 37, 期 11, 页码 3267-3291

出版社

OXFORD UNIV PRESS
DOI: 10.1093/molbev/msaa134

关键词

balancing selection; mixture model; likelihood ratio statistic; bonobo; multiallelic balancing selection

资金

  1. Pennsylvania State University
  2. National Institutes of Health [R35-GM128590]
  3. National Science Foundation [DEB-1753489, DEB-1949268, BCS-2001063]
  4. Alfred P. Sloan Foundation

向作者/读者索取更多资源

Long-term balancing selection typically leaves narrow footprints of increased genetic diversity, and therefore most detection approaches only achieve optimal performances when sufficiently small genomic regions (i.e., windows) are examined. Such methods are sensitive to window sizes and suffer substantial losses in power when windows are large. Here, we employ mixture models to construct a set of five composite likelihood ratio test statistics, which we collectively term B statistics. These statistics are agnostic to window sizes and can operate on diverse forms of input data. Through simulations, we show that they exhibit comparable power to the best-performing current methods, and retain substantially high power regardless of window sizes. They also display considerable robustness to high mutation rates and uneven recombination landscapes, as well as an array of other common confounding scenarios. Moreover, we applied a specific version of the B statistics, termed B-2, to a human population-genomic data set and recovered many top candidates from prior studies, including the then-uncharacterized STPG2 and CCDC169-SOHLH2, both of which are related to gamete functions. We further applied B2 on a bonobo population-genomic data set. In addition to the MHC-DQ genes, we uncovered several novel candidate genes, such as KLRD1, involved in viral defense, and SCN9A, associated with pain perception. Finally, we show that our methods can be extended to account formultiallelic balancing selection and integrated the set of statistics into open-source software named BalLeRMix for future applications by the scientific community.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据