4.6 Article

Random-effects meta-analysis of effect sizes as a unified framework for gene set analysis

Journal

PLOS COMPUTATIONAL BIOLOGY
Volume 18, Issue 10, Pages -

Publisher

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.1010278

Keywords

-

Funding

  1. Science Foundation Ireland [16/IA/4612, 18/CRT/6214]
  2. NIH from NHGRI [K99/R00, 5R00HG009679-03]
  3. R35 award from NIGMS [1R35GM138293-01]
  4. Science Foundation Ireland (SFI) [16/IA/4612, 18/CRT/6214] Funding Source: Science Foundation Ireland (SFI)

Ask authors/readers for more resources

Gene set analysis (GSA) is a common step in genome-scale studies that can reveal insights not apparent from individual gene analysis. This study presents a unified framework for GSA that fits effect size distributions and tests for differences between gene sets. The approach takes into account uncertainty in effect size estimates and provides significant gains in performance over existing methods, as demonstrated by simulation and real data analysis.
Gene set analysis (GSA) remains a common step in genome-scale studies because it can reveal insights that are not apparent from results obtained for individual genes. Many different computational tools are applied for GSA, which may be sensitive to different types of signals; however, most methods implicitly test whether there are differences in the distribution of the effect of some experimental condition between genes in gene sets of interest. We have developed a unifying framework for GSA that first fits effect size distributions, and then tests for differences in these distributions between gene sets. These differences can be in the proportions of genes that are perturbed or in the sign or size of the effects. Inspired by statistical meta-analysis, we take into account the uncertainty in effect size estimates by reducing the influence of genes with greater uncertainty on the estimation of distribution parameters. We demonstrate, using simulation and by application to real data, that this approach provides significant gains in performance over existing methods. Furthermore, the statistical tests carried out are defined in terms of effect sizes, rather than the results of prior statistical tests measuring these changes, which leads to improved interpretability and greater robustness to variation in sample sizes. Author summary The role of gene set analysis is to identify groups of genes that are perturbed in a genomics experiment. There are many tools available for this task and they do not all test for the same types of changes. Here we propose a new way to carry out gene set analysis that involves first working out the distribution of the group effect in the gene set and then comparing this distribution to the equivalent distribution in other genes. Tests performed by existing tools for gene set analysis can be related to different comparisons in these distributions of group effects. A unified framework for gene set analysis provides for more explicit null hypotheses against which to test sets of genes for different types of responses to the experimental conditions. These results are more interpretable, because the group effect distributions can be compared visually, providing an indication of how the experimental effect differs between the gene sets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available