4.7 Article

MixupMapper: correcting sample mix-ups in genome-wide datasets increases power to detect small genetic effects

Journal

BIOINFORMATICS
Volume 27, Issue 15, Pages 2104-2111

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btr323

Keywords

-

Funding

  1. Netherlands Genomics Initiative [93519031]
  2. Netherlands Organization for Scientific Research (NWO, ZonMW [916.10.135]
  3. European Community [FP7/2007-2013, 259867]

Ask authors/readers for more resources

Motivation: Sample mix-ups can arise during sample collection, handling, genotyping or data management. It is unclear how often sample mix-ups occur in genome-wide studies, as there currently are no post hoc methods that can identify these mix-ups in unrelated samples. We have therefore developed an algorithm (MixupMapper) that can both detect and correct sample mix-ups in genome-wide studies that study gene expression levels. Results: We applied MixupMapper to five publicly available human genetical genomics datasets. On average, 3% of all analyzed samples had been assigned incorrect expression phenotypes: in one of the datasets 23% of the samples had incorrect expression phenotypes. The consequences of sample mix-ups are substantial: when we corrected these sample mix-ups, we identified on average 15% more significant cis-expression quantitative trait loci (cis-eQTLs). In one dataset, we identified three times as many significant cis-eQTLs after correction. Furthermore, we show through simulations that sample mix-ups can lead to an underestimation of the explained heritability of complex traits in genome-wide association datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available