4.7 Article

Branch-and-bound algorithm for optimal sparse canonical correlation analysis

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 217, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2023.119530

Keywords

Canonical correlation analysis; Branch-and-bound algorithm; Sparse estimation; Mixed-integer optimization; Statistics

Ask authors/readers for more resources

Canonical correlation analysis (CCA) is a multivariate statistical method for extracting mutual information from multiple datasets. We propose a mixed-integer optimization (MIO) approach to improve the interpretability and efficiency of CCA estimation. Our branch-and-bound algorithm based on the generalized eigenvalue problem can find an optimal solution in terms of canonical correlation, outperforming direct application of optimization software. Moreover, our method provides better-quality solutions than forward stepwise selection and L1-regularized estimation in terms of generalization performance.
Canonical correlation analysis (CCA) is a family of multivariate statistical methods for extracting mutual information contained in multiple datasets. To improve the interpretability of CCA, here we focus on the mixed-integer optimization (MIO) approach to sparse estimation. This approach was first proposed for sparse linear regression in the 1970s, but it has recently received renewed attention due to advances in optimization algorithms and computer hardware. To exactly solve an MIO problem for optimal sparse CCA estimation, we propose a branch-and-bound algorithm based on the generalized eigenvalue problem for computing effective lower and upper bounds. We prove that our algorithm finds a solution with guaranteed optimality in terms of the canonical correlation. Computational results demonstrate that our method is much faster than direct application of optimization software to the MIO problem. Moreover, our method can provide better -quality solutions than can forward stepwise selection and L1-regularized estimation in terms of generalization performance. These results enhance the potential of optimal sparse estimation in multivariate statistical analyses.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available