4.7 Article

Focused goodness of fit tests for gene set analyses

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 23, Issue 1, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbab472

Keywords

gene set; goodness-of-fit test; genetic association; rare variants; amyotrophic lateral sclerosis

Ask authors/readers for more resources

Gene set-based signal detection analysis is used to detect the association between a trait and a set of genes by accumulating signals across the genes in the set. This study presents a flexible framework based on tail-focused GOF statistics, which depends on two critical parameters. Guidance on statistic selection is provided and the methods are implemented in the user-friendly R package wHC. The methods are applied to a study on amyotrophic lateral sclerosis.
Gene set-based signal detection analyses are used to detect an association between a trait and a set of genes by accumulating signals across the genes in the gene set. Since signal detection is concerned with identifying whether any of the genes in the gene set are non-null, a goodness-of-fit (GOF) test can be used to compare whether the observed distribution of gene-level tests within the gene set agrees with the theoretical null distribution. Here, we present a flexible gene set-based signal detection framework based on tail-focused GOF statistics. We show that the power of the various statistics in this framework depends critically on two parameters: the proportion of genes within the gene set that are non-null and the degree of separation between the null and alternative distributions of the gene-level tests. We give guidance on which statistic to choose for a given situation and implement the methods in a fast and user-friendly R package, wHC (https://github.com/mqzhanglab/wHC). Finally, we apply these methods to a whole exome sequencing study of amyotrophic lateral sclerosis.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available