4.2 Article

The utility of empirically assigning ancestry groups in cross-population genetic studies of addiction

Journal

AMERICAN JOURNAL ON ADDICTIONS
Volume 26, Issue 5, Pages 494-501

Publisher

WILEY
DOI: 10.1111/ajad.12586

Keywords

-

Funding

  1. National Institute on Alcohol Abuse and Alcoholism [R37 AA011408]
  2. National Institutes of Health [P20 AA107828, P50 AA022537]
  3. Virginia Commonwealth University
  4. National Center for Research Resources [UL1 RR031990]
  5. National Institutes of Health Roadmap for Medical Research
  6. NIH [AA022537, MH020030]
  7. [K02 AA018755]
  8. [AA021399]

Ask authors/readers for more resources

Background and ObjectivesGiven moderate heritability and significant heterogeneity among addiction phenotypes, successful genome-wide association studies (GWAS) are expected to need very large samples. As sample sizes grow, so can genetic diversity leading to challenges in analyzing these data. Methods for empirically assigning individuals to genetically informed ancestry groups are needed. MethodsWe describe a strategy for empirically assigning ancestry groups in ethnically diverse GWAS data including extensions of principal component analysis (PCA) and population matching through minimum Mahalanobis distance. We apply these methods to data from Spit for Science (S4S): the University Student Survey, a study following college students longitudinally that includes genetic and environmental data on substance use and mental health (n=7,603). ResultsThe genetic-based population assignments for S4S were 48.7% European, 22.5% African, 10.4% Americas, 9.2% East Asian, and 9.2% South Asian descent. Self-reported census categories More than one race and Unknownas well as Hawaiian/Pacific Islander and American-Indian/Native Alaskan were empirically assigned representing a +9% sample retention over conventional methods. Although there was high concordance between self-reported race and empirical population-match (+.924), there was reduction in variance for most ancestry PCs for genetic-based population assignments. ConclusionsWe were able to create more genetically homogenous groups and reduce sample and marker loss through cross-ancestry meta-analysis, potentially increasing power to detect etiologically relevant variation. Our approach provides a framework for empirically assigning genetic ancestry groups which can be applied to other ethnically diverse genetic studies. Scientific SignificanceGiven the important public health impact and demonstrable gains in statistical power from studying diverse populations, empirically sound practices for genetic studies are needed.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available