4.6 Article

Inference with Transposable Data: Modelling the Effects of Row and Column Correlations

出版社

OXFORD UNIV PRESS
DOI: 10.1111/j.1467-9868.2011.01027.x

关键词

Covariance estimation; False discovery rate; Large-scale inference; Matrix variate normal distribution; Multiple testing; Transposable regularized covariance models

资金

  1. National Science Foundation [DMS-9971405]
  2. National Institutes of Health [N01-HV-28183]

向作者/读者索取更多资源

We consider the problem of large-scale inference on the row or column variables of data in the form of a matrix. Many of these data matrices are transposable meaning that neither the row variables nor the column variables can be considered independent instances. An example of this scenario is detecting significant genes in microarrays when the samples may be dependent because of latent variables or unknown batch effects. By modelling this matrix data by using the matrix variate normal distribution, we study and quantify the effects of row and column correlations on procedures for large-scale inference. We then propose a simple solution to the myriad of problems that are presented by unexpected correlations: we simultaneously estimate row and column covariances and use these to sphere or decorrelate the noise in the underlying data before conducting inference. This procedure yields data with approximately independent rows and columns so that test statistics more closely follow null distributions and multiple-testing procedures correctly control the desired error rates. Results on simulated models and real microarray data demonstrate major advantages of this approach: increased statistical power, less bias in estimating the false discovery rate and reduced variance of the false discovery rate estimators.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据