4.5 Article

A standardized framework for representation of ancestry data in genomics studies, with application to the NHGRI-EBI GWAS Catalog

Journal

GENOME BIOLOGY
Volume 19, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/s13059-018-1396-2

Keywords

Genomics; Genome-wide association studies; GWAS Catalog; Ancestry; Diversity; Population genetics

Funding

  1. National Human Genome Research Institute
  2. National Institute of General Medical Sciences of the National Institutes of Health [U41-HG007823, U41-HG006104]
  3. European Molecular Biology Laboratory

Ask authors/readers for more resources

The accurate description of ancestry is essential to interpret, access, and integrate human genomics data, and to ensure that these benefit individuals from all ancestral backgrounds. However, there are no established guidelines for the representation of ancestry information. Here we describe a framework for the accurate and standardized description of sample ancestry, and validate it by application to the NHGRI-EBI GWAS Catalog. We confirm known biases and gaps in diversity, and find that African and Hispanic or Latin American ancestry populations contribute a disproportionately high number of associations. It is our hope that widespread adoption of this framework will lead to improved analysis, interpretation, and integration of human genomics data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available