4.5 Article

A compound decision approach to covariance matrix estimation

Journal

BIOMETRICS
Volume 79, Issue 2, Pages 1201-1212

Publisher

WILEY
DOI: 10.1111/biom.13686

Keywords

compound decision theory; g-modeling; nonparametric maximum likelihood; separable decision rule

Ask authors/readers for more resources

This paper studies a new approach to estimating high-dimensional covariance matrices and frames it as a compound decision problem. By using a nonparametric empirical Bayes g-modeling approach, the optimal rule in the class is estimated. Experimental results show that this method can achieve comparable or better performance in gene network inference.
Covariance matrix estimation is a fundamental statistical task in many applications, but the sample covariance matrix is suboptimal when the sample size is comparable to or less than the number of features. Such high-dimensional settings are common in modern genomics, where covariance matrix estimation is frequently employed as a method for inferring gene networks. To achieve estimation accuracy in these settings, existing methods typically either assume that the population covariance matrix has some particular structure, for example, sparsity, or apply shrinkage to better estimate the population eigenvalues. In this paper, we study a new approach to estimating high-dimensional covariance matrices. We first frame covariance matrix estimation as a compound decision problem. This motivates defining a class of decision rules and using a nonparametric empirical Bayes g-modeling approach to estimate the optimal rule in the class. Simulation results and gene network inference in an RNA-seq experiment in mouse show that our approach is comparable to or can outperform a number of state-of-the-art proposals.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available