4.7 Article

The Molecular Signatures Database Hallmark Gene Set Collection

Journal

CELL SYSTEMS
Volume 1, Issue 6, Pages 417-425

Publisher

CELL PRESS
DOI: 10.1016/j.cels.2015.12.004

Keywords

-

Funding

  1. NIH [R01CA154480, R01CA121941, R01GM074024, U54CA112962]

Ask authors/readers for more resources

The Molecular Signatures Database (MSigDB) is one of the most widely used and comprehensive databases of gene sets for performing gene set enrichment analysis. Since its creation, MSigDB has grown beyond its roots in metabolic disease and cancer to include >10,000 gene sets. These better represent a wider range of biological processes and diseases, but the utility of the database is reduced by increased redundancy across, and heterogeneity within, gene sets. To address this challenge, here we use a combination of automated approaches and expert curation to develop a collection of hallmark'' gene sets as part of MSigDB. Each hallmark in this collection consists of a refined'' gene set, derived from multiple founder'' sets, that conveys a specific biological state or process and displays coherent expression. The hallmarks effectively summarize most of the relevant information of the original founder sets and, by reducing both variation and redundancy, provide more refined and concise inputs for gene set enrichment analysis.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available