4.6 Article

Down-weighting overlapping genes improves gene set analysis

期刊

BMC BIOINFORMATICS
卷 13, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/1471-2105-13-136

关键词

Gene expression; Gene set analysis; Pathway analysis; Overlapping gene sets

资金

  1. Intramural Research Program of the National Institute of Child Health and Human Development, NIH/DHHS
  2. NIH [RO1 RDK089167-01]
  3. NSF [DBI-0965741]
  4. Robert J. Sokol Endowment in Systems Biology
  5. Div Of Biological Infrastructure
  6. Direct For Biological Sciences [0965741] Funding Source: National Science Foundation

向作者/读者索取更多资源

Background: The identification of gene sets that are significantly impacted in a given condition based on microarray data is a crucial step in current life science research. Most gene set analysis methods treat genes equally, regardless how specific they are to a given gene set. Results: In this work we propose a new gene set analysis method that computes a gene set score as the mean of absolute values of weighted moderated gene t-scores. The gene weights are designed to emphasize the genes appearing in few gene sets, versus genes that appear in many gene sets. We demonstrate the usefulness of the method when analyzing gene sets that correspond to the KEGG pathways, and hence we called our method Pathway Analysis with Down-weighting of Overlapping Genes (PADOG). Unlike most gene set analysis methods which are validated through the analysis of 2-3 data sets followed by a human interpretation of the results, the validation employed here uses 24 different data sets and a completely objective assessment scheme that makes minimal assumptions and eliminates the need for possibly biased human assessments of the analysis results. Conclusions: PADOG significantly improves gene set ranking and boosts sensitivity of analysis using information already available in the gene expression profiles and the collection of gene sets to be analyzed. The advantages of PADOG over other existing approaches are shown to be stable to changes in the database of gene sets to be analyzed. PADOG was implemented as an R package available at: http://bioinformaticsprb.med.wayne.edu/PADOG/ or www.bioconductor.org.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据