4.6 Article

MetaGSCA: A tool for meta-analysis of gene set differential coexpression

Journal

PLOS COMPUTATIONAL BIOLOGY
Volume 17, Issue 5, Pages -

Publisher

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.1008976

Keywords

-

Funding

  1. NCI NIH HHS [P50 CA098131, P30 CA068485, P30 CA118100] Funding Source: Medline

Ask authors/readers for more resources

The MetaGSCA tool allows for comprehensive meta-analyses of gene set differential coexpression data, identifying relevant pathways and visualizing them. The tool demonstrated its effectiveness in case studies of chronic kidney disease and non-small cell lung cancer, as well as in a pan-cancer analysis of 11 cancer types. Analysis with randomly generated gene sets showed low false positive rates, indicating the tool's specificity.
Analyses of gene set differential coexpression may shed light on molecular mechanisms underlying phenotypes and diseases. However, differential coexpression analyses of conceptually similar individual studies are often inconsistent and underpowered to provide definitive results. Researchers can greatly benefit from an open-source application facilitating the aggregation of evidence of differential coexpression across studies and the estimation of more robust common effects. We developed Meta Gene Set Coexpression Analysis (MetaGSCA), an analytical tool to systematically assess differential coexpression of an a priori defined gene set by aggregating evidence across studies to provide a definitive result. In the kernel, a nonparametric approach that accounts for the gene-gene correlation structure is used to test whether the gene set is differentially coexpressed between two comparative conditions, from which a permutation test p-statistic is computed for each individual study. A meta-analysis is then performed to combine individual study results with one of two options: a random-intercept logistic regression model or the inverse variance method. We demonstrated MetaGSCA in case studies investigating two human diseases and identified pathways highly relevant to each disease across studies. We further applied MetaGSCA in a pan-cancer analysis with hundreds of major cellular pathways in 11 cancer types. The results indicated that a majority of the pathways identified were dysregulated in the pan-cancer scenario, many of which have been previously reported in the cancer literature. Our analysis with randomly generated gene sets showed excellent specificity, indicating that the significant pathways/gene sets identified by MetaGSCA are unlikely false positives. MetaGSCA is a user-friendly tool implemented in both forms of a Web-based application and an R package MetaGSCA. It enables comprehensive meta-analyses of gene set differential coexpression data, with an optional module of post hoc pathway crosstalk network analysis to identify and visualize pathways having similar coexpression profiles. Author summary Analyses of gene set differential coexpression often shed light on molecular mechanisms underlying phenotypes and diseases. However, results from conceptually similar individual studies are often inconsistent and underpowered to reach definitive conclusions. We provide an open-source application facilitating the aggregation of evidence of differential coexpression across studies and the estimation of more robust common effects, with an optional module of post hoc pathway crosstalk network analysis to identify and visualize pathways having similar coexpression profiles. We established the usefulness of MetaGSCA via case studies of chronic kidney disease and non-small cell lung cancer, and applied it to a pan-cancer analysis of 11 cancer types. We further demonstrated the tool with 100 randomly generated gene sets and showed excellent specificity, indicating low false positive rates.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available