4.4 Article

A Hierarchical Bayesian Model for Next-Generation Population Genomics

Journal

GENETICS
Volume 187, Issue 3, Pages 903-917

Publisher

GENETICS SOCIETY AMERICA
DOI: 10.1534/genetics.110.124693

Keywords

-

Funding

  1. National Science Foundation Division of Biological Infrastructure (DBI) [0701757]
  2. Direct For Biological Sciences
  3. Division Of Environmental Biology [1050726, 1050149, 1011173, 1050947, 1050355] Funding Source: National Science Foundation
  4. Direct For Biological Sciences
  5. Division Of Integrative Organismal Systems [0701757] Funding Source: National Science Foundation

Ask authors/readers for more resources

The demography of populations and natural selection shape genetic variation across the genome and understanding the genomic consequences of these evolutionary processes is a fundamental aim of population genetics. We have developed a hierarchical Bayesian model to quantify genome-wide population structure and identify candidate genetic regions affected by selection. This model improves on existing methods by accounting for stochastic sampling of sequences inherent in next-generation sequencing (with pooled or indexed individual samples) and by incorporating genetic distances among haplotypes in measures of genetic differentiation. Using simulations we demonstrate that this model has a low falsepositive rate for classifying neutral genetic regions as selected genes (i.e., fST outliers), but can detect recent selective sweeps, particularly when genetic regions in multiple populations are affected by selection. Nonetheless, selection affecting just a single population was difficult to detect and resulted in a high falsenegative rate under certain conditions. We applied the Bayesian model to two large sets ofhuman population genetic data. We found evidence of widespread positive and balancing selection among worldwide human populations, including many genetic regions previously thought to be under selection. Additionally, we identified novel candidate genes for selection, several of which have been linked to human diseases. This model will facilitate the population genetic analysis of a wide range of organisms on the basis of nextgeneration sequence data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available