4.3 Article Proceedings Paper

Population Stratification and Patterns of Linkage Disequilibrium

Journal

GENETIC EPIDEMIOLOGY
Volume 33, Issue -, Pages S88-S92

Publisher

WILEY
DOI: 10.1002/gepi.20478

Keywords

genetic association; genome-wide association study; principal components; multidimensional scaling; ethnic substructure

Funding

  1. NATIONAL CENTER FOR RESEARCH RESOURCES [KL2RR024990] Funding Source: NIH RePORTER
  2. NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES [R01GM031575] Funding Source: NIH RePORTER
  3. NATIONAL INSTITUTE ON ALCOHOL ABUSE AND ALCOHOLISM [K01AA015572] Funding Source: NIH RePORTER
  4. NCRR NIH HHS [KLZ RR024990] Funding Source: Medline
  5. NIAAA NIH HHS [AA015572, K01 AA015572, K01 AA015572-04] Funding Source: Medline
  6. NIGMS NIH HHS [R01 GM031575-25, R01 GM031575] Funding Source: Medline

Ask authors/readers for more resources

Although the importance of selecting cases and controls from the same population has been recognized for decades, the recent advent of genome-wide association studies has heightened awareness of this issue. Because these studies typically deal with large samples, small differences in allele frequencies between cases and controls can easily reach statistical significance. When, unbeknownst to a researcher, cases and controls have different substructures, the number of false-positive findings is inflated. There have been three recent developments of purely statistical approaches to assessing the ancestral comparability of case and control samples: genomic control, structured association, and multivariate reduction analyses. The widespread use of high-throughput technology has allowed the quick and accurate genotyping of the large number of markers required by these methods. Group 13 dealt with four population stratification issues: single-nucleotide polymorphism marker selection, association testing, nonstandard methods, and linkage disequilibrium calculations in stratified or mixed ethnicity samples. We demonstrated that there are continuous axes of ethnic variation in both data sets of Genetic Analysis Workshop 16. Furthermore, ignoring this structure created P-value inflation for a variety of phenotypes. Principal-components analysis (or multidimensional scaling) can control inflation as covariates in a logistic regression. One can weigh for local ancestry estimation and allow the use of related individuals. Problems arise in the presence of extremely high association or unusually strong linkage disequilibrium (e.g., in chromosomal inversions). Our group also reported a method for performing an association test controlling for substructure, when genome-wide markers are not available, to explicitly compute stratification Genet. Epidetniol. 33 (Suppl. 1):S88-S92, 2009. (C) 2009 Wiley-Liss, Inc.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available