4.6 Article

A Generalized Least-Square Matrix Decomposition

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
卷 109, 期 505, 页码 145-159

出版社

AMER STATISTICAL ASSOC
DOI: 10.1080/01621459.2013.852978

关键词

Functional PCA; Neuroimaging; Principal components analysis; Singular value decomposition; Sparse PCA; Structured data

资金

  1. NSF [DMS-1209017, DMS-0906801]
  2. NSF IGERT Award [0801700]
  3. Division Of Mathematical Sciences
  4. Direct For Mathematical & Physical Scien [1209017] Funding Source: National Science Foundation

向作者/读者索取更多资源

Variables in many big-data settings are structured, arising, for example, from measurements on a regular grid as in imaging and time series or from spatial-temporal measurements as in climate studies. Classical multivariate techniques ignore these structural relationships often resulting in poor performance. We propose a generalization of principal components analysis (PCA) that is appropriate for massive datasets with structured variables or known two-way dependencies. By finding the best low-rank approximation of the data with respect to a transposable quadratic norm, our decomposition, entitled the generalized least-square matrix decomposition (GMD), directly accounts for structural relationships. As many variables in high-dimensional settings are often irrelevant, we also regularize our matrix decomposition by adding two-way penalties to encourage sparsity or smoothness. We develop fast computational algorithms using our methods to perform generalized PCA (GPCA), sparse GPCA, and functional GPCA on massive datasets. Through simulations and a whole brain functional MRI example, we demonstrate the utility of our methodology for dimension reduction, signal recovery, and feature selection with high-dimensional structured data. Supplementary materials for this article are available online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据