4.4 Article

A Coalescent Model for Genotype Imputation

Journal

GENETICS
Volume 191, Issue 4, Pages 1239-1255

Publisher

GENETICS SOC AM
DOI: 10.1534/genetics.111.137984

Keywords

-

Funding

  1. National Institutes of Health [R01 GM081441, R01 HG005855, T32 HG000040]
  2. Burroughs Wellcome Fund
  3. Division Of Integrative Organismal Systems
  4. Direct For Biological Sciences [0920231] Funding Source: National Science Foundation

Ask authors/readers for more resources

The potential for imputed genotypes to enhance an analysis of genetic data depends largely on the accuracy of imputation, which in turn depends on properties of the reference panel of template haplotypes used to perform the imputation. To provide a basis for exploring how properties of the reference panel affect imputation accuracy theoretically rather than with computationally intensive imputation experiments, we introduce a coalescent model that considers imputation accuracy in terms of population-genetic parameters. Our model allows us to investigate sampling designs in the frequently occurring scenario in which imputation targets and templates are sampled from different populations. In particular, we derive expressions for expected imputation accuracy as a function of reference panel size and divergence time between the reference and target populations. We find that a modestly sized internal reference panel from the same population as a target haplotype yields, on average, greater imputation accuracy than a larger external panel from a different population, even if the divergence time between the two populations is small. The improvement in accuracy for the internal panel increases with increasing divergence time between the target and reference populations. Thus, in humans, our model predicts that imputation accuracy can be improved by generating small population-specific custom reference panels to augment existing collections such as those of the HapMap or 1000 Genomes Projects. Our approach can be extended to understand additional factors that affect imputation accuracy in complex population-genetic settings, and the results can ultimately facilitate improvements in imputation study designs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available