4.6 Article Proceedings Paper

A randomized exponential canonical correlation analysis method for data analysis and dimensionality reduction

期刊

APPLIED NUMERICAL MATHEMATICS
卷 164, 期 -, 页码 101-124

出版社

ELSEVIER
DOI: 10.1016/j.apnum.2020.09.013

关键词

Canonical correlation analysis (CCA); Matrix exponential; Randomized singular value decomposition (RSVD); Exponential canonical correlation analysis (ECCA); Generalized eigenvalue problem

资金

  1. Fundamental Research Funds for the Central Universities of China [2019XKQYMS89]

向作者/读者索取更多资源

Canonical correlation analysis (CCA) is a widely used data analysis method that aims to extract meaningful information by maximizing the correlation between two sets of variables. However, it may face challenges such as small sample size and overfitting. In this work, an Exponential Canonical Correlation Analysis (ECCA) method based on matrix exponential is proposed, as well as a Randomized Exponential Canonical Correlation Analysis (RECCA) method using randomized singular value decomposition to address these challenges. Experimental results demonstrate the superior performance of these proposed algorithms over existing CCA methods.
Canonical correlation analysis (CCA) is a famous data analysis method that has been successfully used in many areas. CCA extracts meaningful information from a pair of data sets, by seeking pairs of linear combinations from two sets of variables with maximum correlation. Mathematically, CCA resorts to solving a large-scale generalized eigenvalue problem. However, as the dimension of the data sets is much larger than the number of samples, CCA may suffer from the small-sample-size (SSS) problem and the over fitting problem. In order to overcome these difficulties, the regularized technique is often applied, but it is difficult to choose the optimal parameter in advance. In this work, we propose an Exponential Canonical Correlation Analysis (ECCA) method based on matrix exponential, which is parameter-free and can overcome the over-fitting and the SSS problems fundamentally. However, the computational overhead of the ECCA method is very high in practice. Based on the randomized singular value decomposition (RSVD), we then propose a Randomized Exponential Canonical Correlation Analysis (RECCA) method for data analysis and dimensionality reduction. Theoretical results are given to show the rationality of this randomized method, and establish the relationship between RECCA and ECCA. Numerical experiments are performed on some real-world, high-dimensional and large-sample data sets, which illustrate the superiority of the proposed algorithms over many state-of-the-art CCA algorithms. (c) 2020 IMACS. Published by Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据