4.4 Article

Estimation of Allele Frequencies From High-Coverage Genome-Sequencing Projects

Journal

GENETICS
Volume 182, Issue 1, Pages 295-301

Publisher

GENETICS SOCIETY AMERICA
DOI: 10.1534/genetics.109.100479

Keywords

-

Funding

  1. National Science Foundation [EF-0827411]
  2. National Institutes of Health [GM36827]
  3. Lilly Foundation

Ask authors/readers for more resources

A new generation of high-throughput sequencing strategies will soon lead to the acquisition of high-coverage genomic profiles of hundreds to thousands of individuals within species, generating unprecedented levels of information on the frequencies of nucleotides segregating at individual sites. However, because these new technologies are error prone and yield uneven coverage of alleles in diploid individuals, they also introduce the need for novel methods for analyzing the raw read data. A maximum-likelihood method For the estimation of allele frequencies is developed, eliminating both the need to arbitrarily discard individuals with low coverage and the requirement. for an extrinsic measure of the sequence error rate. The resultant estimates are nearly unbiased with asymptotically minimal sampling variance, thereby defining the limits to our ability to estimate population-genetic parameters and providing a logical basis for the optimal design of population-genomic surveys.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available