4.4 Article

Dimension-wise sparse low-rank approximation of a matrix with application to variable selection in high-dimensional integrative analyzes of association

期刊

JOURNAL OF APPLIED STATISTICS
卷 49, 期 15, 页码 3889-3907

出版社

TAYLOR & FRANCIS LTD
DOI: 10.1080/02664763.2021.1967892

关键词

High dimension low sample size; multimodal data; nuclear norm; sparse canonical correlation analysis

向作者/读者索取更多资源

A proposed method aims to characterize the dominant modes of co-variation between variables in two datasets while performing variable selection accurately. The method relies on a sparse, low rank approximation of a matrix containing pairwise association measures between variables from the two sets, closely related to sparse canonical correlation analysis methods. Through simulations, it is shown that the proposed method outperforms state-of-the-art sparse CCA algorithms in terms of variable selection accuracies.
Many research proposals involve collecting multiple sources of information from a set of common samples, with the goal of performing an integrative analysis describing the associations between sources. We propose a method that characterizes the dominant modes of co-variation between the variables in two datasets while simultaneously performing variable selection. Our method relies on a sparse, low rank approximation of a matrix containing pairwise measures of association between the two sets of variables. We show that the proposed method shares a close connection with another group of methods for integrative data analysis - sparse canonical correlation analysis (CCA). Under some assumptions, the proposed method and sparse CCA aim to select the same subsets of variables. We show through simulation that the proposed method can achieve better variable selection accuracies than two state-of-the-art sparse CCA algorithms. Empirically, we demonstrate through the analysis of DNA methylation and gene expression data that the proposed method selects variables that have as high or higher canonical correlation than the variables selected by sparse CCA methods, which is a rather surprising finding given that objective function of the proposed method does not actually maximize the canonical correlation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据