4.6 Article

Learning the properties of adaptive regions with functional data analysis

期刊

PLOS GENETICS
卷 16, 期 8, 页码 -

出版社

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pgen.1008896

关键词

-

资金

  1. National Institutes of Health [R35GM128590]
  2. National Science Foundation [DEB-1753489, DEB-1949268, BCS-2001063]
  3. Alfred P. Sloan Foundation
  4. NIGMS [T32GM102057]
  5. NASA Pennsylvania Space Grant Graduate Fellowship
  6. Graduate Research Innovation Grant from the Huck Institutes of the Life Sciences
  7. National Human Genome Research Institute predoctoral fellowship [1F31HG010574-01]

向作者/读者索取更多资源

Author summary As populations adapt to their environments, specific patterns indicating selection remain in the distribution of genetic diversity across their genomes. A hallmark of positive natural selection is the reduction of genetic diversity surrounding beneficial mutations. The origin of the beneficial mutation, or whether it originated in a population being examined or within another, can be uncovered through the spatial distribution of the reduction of genetic diversity. In addition, other information about the strength, timing, and initial frequency of beneficial mutations can be learned by examining patterns of diversity across genomic regions. We use functional data analysis to capture differences among the spatial distributions of genetic variation expected by diverse evolutionary processes, and further apply it to dissect how selection parameters affect such patterns. Using this method, we learn the underlying origins, timings, and strengths of beneficial mutations that have impacted modern human genomic diversity. Identifying regions of positive selection in genomic data remains a challenge in population genetics. Most current approaches rely on comparing values of summary statistics calculated in windows. We present an approach termedSURFDAWave, which translates measures of genetic diversity calculated in genomic windows to functional data. By transforming our discrete data points to be outputs of continuous functions defined over genomic space, we are able to learn the features of these functions that signify selection. This enables us to confidently identify complex modes of natural selection, including adaptive introgression. We are also able to predict important selection parameters that are responsible for shaping the inferred selection events. By applying our model to human population-genomic data, we recapitulate previously identified regions of selective sweeps, such asOCA2in Europeans, and predict that its beneficial mutation reached a frequency of 0.02 before it swept 1,802 generations ago, a time when humans were relatively new to Europe. In addition, we identifyBNC2in Europeans as a target of adaptive introgression, and predict that it harbors a beneficial mutation that arose in an archaic human population that split from modern humans within the hypothesized modern human-Neanderthal divergence range.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据