4.4 Article

Quantifying Population Genetic Differentiation from Next-Generation Sequencing Data

期刊

GENETICS
卷 195, 期 3, 页码 979-+

出版社

GENETICS SOCIETY AMERICA
DOI: 10.1534/genetics.113.154740

关键词

next-generation sequencing; F-ST; principal components analysis

资金

  1. EMBO Long-Term Post-doctoral Fellowship [ALTF 229-2011]
  2. National Institutes of Health (NIH) Genomics Training Grant [T32HG000047-13]
  3. National Science Foundation [DBI-0906065]
  4. NIH [3R01HG03229-08S2, 3R01HG03229-07]
  5. Villum Fonden [00007171] Funding Source: researchfish

向作者/读者索取更多资源

Over the past few years, new high-throughput DNA sequencing technologies have dramatically increased speed and reduced sequencing costs. However, the use of these sequencing technologies is often challenged by errors and biases associated with the bioinformatical methods used for analyzing the data. In particular, the use of naive methods to identify polymorphic sites and infer genotypes can inflate downstream analyses. Recently, explicit modeling of genotype probability distributions has been proposed as a method for taking genotype call uncertainty into account. Based on this idea, we propose a novel method for quantifying population genetic differentiation from next-generation sequencing data. In addition, we present a strategy for investigating population structure via principal components analysis. Through extensive simulations, we compare the new method herein proposed to approaches based on genotype calling and demonstrate a marked improvement in estimation accuracy for a wide range of conditions. We apply the method to a large-scale genomic data set of domesticated and wild silkworms sequenced at low coverage. We find that we can infer the fine-scale genetic structure of the sampled individuals, suggesting that employing this new method is useful for investigating the genetic relationships of populations sampled at low coverage.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据