4.7 Article

Genomic Dissection of Population Substructure of Han Chinese and Its Implication in Association Studies

Journal

AMERICAN JOURNAL OF HUMAN GENETICS
Volume 85, Issue 6, Pages 762-774

Publisher

CELL PRESS
DOI: 10.1016/j.ajhg.2009.10.015

Keywords

-

Funding

  1. National Outstanding Youth Science Foundation of China [30625016]
  2. National Science Foundation of China [30890034, 30971577]
  3. 863 Program [2007AA02Z312]
  4. Shanghai Leading Academic Discipline Project [B111]
  5. Center for Evolutionary Biology
  6. Science and Technology Commission of Shanghai Municipality [09ZRI436400]
  7. Knowledge Innovation Program of Shanghai Institutes for Biological Sciences
  8. Chinese Academy of Sciences [2008KIP311]
  9. SA-SIBS Scholarship Program and the K.C.Wong Education Foundation, Hong Kong

Ask authors/readers for more resources

To date, most genome-wide association studies (GWAS) and studies of fine-scale population structure have been conducted primarily on Europeans. Han Chinese, the largest ethnic group in the world, composing 20% of the entire global human population, is largely under-represented in such studies. A well-recognized challenge is the fact that population structure can cause spurious associations in GWAS. In this study, we examined population substructures in a diverse set of over 1700 Han Chinese samples collected from 26 regions across China, each genotyped at similar to 160K single-nucleotide polymorphisms (SNPs). Our results showed that the Han Chinese population is intricately substructured, with the main observed clusters corresponding roughly to northern Han, central Han, and southern Han. However, simulated case-control studies showed that genetic differentiation among these clusters, although very small (F-ST = 0.0002 similar to 0.0009), is sufficient to lead to an inflated rate of false-positive results even when the sample size is moderate. The top two SNPs with the greatest frequency differences between the northern Han and southern Han clusters (F-ST > 0.06) were found in the FADS2 gene, which associates with the fatty acid composition in phospholipids, and in the HLA complex PS gene (HCP5), which associates with HIV infection, psoriasis, and psoriatic arthritis. Ingenuity Pathway Analysis (IPA) showed that most differentiated genes among clusters are involved in cardiac arteriopathy (p < 10(-101)). These signals indicating significant differences among Han Chinese subpopulations should be carefully explained in case they are also detected in association studies, especially when sample sources are diverse.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available