4.4 Article

TESTING HIGH-DIMENSIONAL COVARIANCE MATRICES, WITH APPLICATION TO DETECTING SCHIZOPHRENIA RISK GENES

Journal

ANNALS OF APPLIED STATISTICS
Volume 11, Issue 3, Pages 1810-1831

Publisher

INST MATHEMATICAL STATISTICS
DOI: 10.1214/17-AOAS1062

Keywords

Permutation test; high-dimensional data; covariance matrix; sparse principal component analysis

Funding

  1. Takeda Pharmaceuticals Company Limited
  2. F. Hoffman-La Roche Ltd
  3. NIH [R01MH085542, R01MH093725, P50MH066392, P50MH080405, R01MH097276, RO1-MH-075916, P50M096891, P50MH0840-53S1, R37MH057881, R37MH057881S1, HHSN271201300031C, AG02219, AG05138, MH06692]
  4. Direct For Mathematical & Physical Scien
  5. Division Of Mathematical Sciences [1553884] Funding Source: National Science Foundation

Ask authors/readers for more resources

Scientists routinely compare gene expression levels in cases versus controls in part to determine genes associated with a disease. Similarly, detecting case-control differences in co-expression among genes can be critical to understanding complex human diseases; however, statistical methods have been limited by the high-dimensional nature of this problem. In this paper, we construct a sparse-Leading-Eigenvalue- Driven (sLED) test for comparing two high-dimensional covariance matrices. By focusing on the spectrum of the differential matrix, sLED provides a novel perspective that accommodates what we assume to be common, namely sparse and weak signals in gene expression data, and it is closely related with sparse principal component analysis. We prove that sLED achieves full power asymptotically under mild assumptions, and simulation studies verify that it outperforms other existing procedures under many biologically plausible scenarios. Applying sLED to the largest gene-expression dataset obtained from post-mortem brain tissue from Schizophrenia patients and controls, we provide a novel list of genes implicated in Schizophrenia and reveal intriguing patterns in gene co-expression change for Schizophrenia subjects. We also illustrate that sLED can be generalized to compare other gene-gene relationship matrices that are of practical interest, such as the weighted adjacency matrices.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available