4.7 Article

Genotype-Imputation Accuracy across Worldwide Human Populations

Journal

AMERICAN JOURNAL OF HUMAN GENETICS
Volume 84, Issue 2, Pages 235-250

Publisher

CELL PRESS
DOI: 10.1016/j.ajhg.2009.01.013

Keywords

-

Funding

  1. National Institutes of Health [R01 GM081441, R01 HL090564, U01 HL084729]
  2. Burroughs Wellcome Fund
  3. Alfred P. Sloan Foundation
  4. National Institute on Aging, National Institutes of Health, Department of Health and Human Services [Z01AG000949-02]
  5. MRC [G0701075] Funding Source: UKRI
  6. Medical Research Council [G0701075] Funding Source: researchfish
  7. Parkinson's UK [G-0907] Funding Source: researchfish

Ask authors/readers for more resources

A Current approach to mapping complex-disease-susceptibility loci in genome-wide association (GWA) studies involves leveraging the information in a reference database of dense genotype data. By modeling the patterns of linkage disequilibrium in a reference panel, genotypes not directly measured In the study samples can be imputed and tested for disease association. This imputation strategy has been successful for GWA studies in populations well represented by existing reference panels. We used genotypes at 513,008 autosomal single-nucleotide polymorphism (SNP) loci in 443 unrelated individuals from 29 worldwide populations to evaluate the portability of the HapMap reference panels for imputation in studies of diverse populations. When a single HapMap panel was leveraged for imputation of randomly masked genotypes, European populations had the highest imputation accuracy, followed by populations from East Asia, Central and South Asia, the Americas, Oceania, the Middle East, and Africa. For each population, we identified optimal mixtures of reference panels that maximized imputation accuracy, and we found that in most populations, mixtures including individuals from at least two HapMap panels produced the highest imputation accuracy. From a separate survey of additional SNPs typed in the same samples, we evaluated imputation accuracy in the scenario in which all genotypes at a given SNP position were unobserved and were imputed on the basis of data from a commercial SNP chip, again finding that most populations benefited from the use of combinations of two or more HapMap reference panels. Our results can serve as a guide for selecting appropriate reference panels for imputation-based GWA analysis in diverse populations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available