4.3 Article

Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories

期刊

G3-GENES GENOMES GENETICS
卷 7, 期 11, 页码 3605-3620

出版社

OXFORD UNIV PRESS INC
DOI: 10.1534/g3.117.300259

关键词

pairwise sequentially; Markovian coalescent; site frequency spectrum; population genetics; demographic inference; nonmodel organisms

资金

  1. National Institutes of Health (NIH) [R35 GM-119856]
  2. NIH Training Grant in Genomic Analysis and Interpretation [T32 HG-002536]
  3. National Science Foundation Graduate Research Fellowship Program
  4. NIH under the Ruth L. Kirschestein National Research Service Award [T32 GM-008185]
  5. Direct For Biological Sciences [1556705] Funding Source: National Science Foundation
  6. Division Of Environmental Biology [1556705] Funding Source: National Science Foundation

向作者/读者索取更多资源

Inference of demographic history from genetic data is a primary goal of population genetics of KEYWORDS model and nonmodel organisms. Whole genome-based approaches such as the pairwise/multiple pairwise sequentially Markovian coalescent methods use genomic data from one to four individuals to infer the sequentially demographic history of an entire population, while site frequency spectrum (SFS)-based methods use the Markovian distribution of allele frequencies in a sample to reconstruct the same historical events. Although both coalescent methods are extensively used in empirical studies and perform well on data simulated under simple models, site frequency there have been only limited comparisons of them in more complex and realistic settings. Here we use spectrum published demographic models based on data from three human populations (Yoruba, descendants of population northwest-Europeans, and Han Chinese) as an empirical test case to study the behavior of both inference genetics procedures. We find that several of the demographic histories inferred by the whole genome-based demographic methods do not predict the genome-wide distribution of heterozygosity, nor do they predict the empirical inference SFS. However, using simulated data, we also find that the whole genome methods can reconstruct the nonmodel complex demographic models inferred by SFS-based methods, suggesting that the discordant patterns of organisms genetic variation are not attributable to a lack of statistical power, but may reflect unmodeled complexities in the underlying demography. More generally, our findings indicate that demographic inference from a small number of genomes, routine in genomic studies of nonmodel organisms, should be interpreted cautiously, as these models cannot recapitulate other summaries of the data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据