4.7 Article Proceedings Paper

Mixed-model coexpression: calculating gene coexpression while accounting for expression heterogeneity

期刊

BIOINFORMATICS
卷 27, 期 13, 页码 I288-I294

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btr221

关键词

-

资金

  1. NHGRI NIH HHS [T32-HG002536, HG00521401, T32 HG002536] Funding Source: Medline
  2. NHLBI NIH HHS [1K25HL080079, K25 HL080079] Funding Source: Medline
  3. NIDA NIH HHS [U01-DA024417] Funding Source: Medline
  4. NIEHS NIH HHS [N01-ES-45530] Funding Source: Medline
  5. NIH HHS [NH084698] Funding Source: Medline
  6. Direct For Computer & Info Scie & Enginr
  7. Div Of Information & Intelligent Systems [0916676] Funding Source: National Science Foundation

向作者/读者索取更多资源

Motivation: The analysis of gene coexpression is at the core of many types of genetic analysis. The coexpression between two genes can be calculated by using a traditional Pearson's correlation coefficient. However, unobserved confounding effects may cause inflation of the Pearson's correlation so that uncorrelated genes appear correlated. Many general methods have been suggested, which aim to remove the effects of confounding from gene expression data. However, the residual confounding which is not accounted for by these generic correction procedures has the potential to induce correlation between genes. Therefore, amethod that specifically aims to calculate gene coexpression between gene expression arrays, while accounting for confounding effects, is desirable. Results: In this article, we present a statistical model for calculating gene coexpression called mixed model coexpression (MMC), which models coexpression within a mixed model framework. Confounding effects are expected to be encoded in the matrix representing the correlation between arrays, the inter-sample correlation matrix. By conditioning on the information in the inter-sample correlation matrix, MMC is able to produce gene coexpressions that are not influenced by global confounding effects and thus significantly reduce the number of spurious coexpressions observed. We applied MMC to both human and yeast datasets and show it is better able to effectively prioritize strong coexpressions when compared to a traditional Pearson's correlation and a Pearson's correlation applied to data corrected with surrogate variable analysis (SVA).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据