4.6 Article

kruX: matrix-based non-parametric eQTL discovery

Journal

BMC BIOINFORMATICS
Volume 15, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/1471-2105-15-11

Keywords

eQTL; Non-parametric methods; Matrix algebra; Software

Funding

  1. Freiburg Institute for Advanced Studies
  2. BBSRC
  3. Biotechnology and Biological Sciences Research Council [BBS/E/D/20211553, BBS/E/D/20211551, BBS/E/D/20211552] Funding Source: researchfish
  4. BBSRC [BBS/E/D/20211553, BBS/E/D/20211552, BBS/E/D/20211551] Funding Source: UKRI

Ask authors/readers for more resources

Background: The Kruskal-Wallis test is a popular non-parametric statistical test for identifying expression quantitative trait loci (eQTLs) from genome-wide data due to its robustness against variations in the underlying genetic model and expression trait distribution, but testing billions of marker-trait combinations one-by-one can become computationally prohibitive. Results: We developed kruX, an algorithm implemented in Matlab, Python and R that uses matrix multiplications to simultaneously calculate the Kruskal-Wallis test statistic for several millions of marker-trait combinations at once. KruX is more than ten thousand times faster than computing associations one-by-one on a typical human dataset. We used kruX and a dataset of more than 500k SNPs and 20k expression traits measured in 102 human blood samples to compare eQTLs detected by the Kruskal-Wallis test to eQTLs detected by the parametric ANOVA and linear model methods. We found that the Kruskal-Wallis test is more robust against data outliers and heterogeneous genotype group sizes and detects a higher proportion of non-linear associations, but is more conservative for calling additive linear associations. Conclusion: kruX enables the use of robust non-parametric methods for massive eQTL mapping without the need for a high-performance computing infrastructure and is freely available from http://krux.googlecode.com.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available