Journal
EXPERT SYSTEMS WITH APPLICATIONS
Volume 217, Issue -, Pages -Publisher
PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2023.119530
Keywords
Canonical correlation analysis; Branch-and-bound algorithm; Sparse estimation; Mixed-integer optimization; Statistics
Ask authors/readers for more resources
Canonical correlation analysis (CCA) is a multivariate statistical method for extracting mutual information from multiple datasets. We propose a mixed-integer optimization (MIO) approach to improve the interpretability and efficiency of CCA estimation. Our branch-and-bound algorithm based on the generalized eigenvalue problem can find an optimal solution in terms of canonical correlation, outperforming direct application of optimization software. Moreover, our method provides better-quality solutions than forward stepwise selection and L1-regularized estimation in terms of generalization performance.
Canonical correlation analysis (CCA) is a family of multivariate statistical methods for extracting mutual information contained in multiple datasets. To improve the interpretability of CCA, here we focus on the mixed-integer optimization (MIO) approach to sparse estimation. This approach was first proposed for sparse linear regression in the 1970s, but it has recently received renewed attention due to advances in optimization algorithms and computer hardware. To exactly solve an MIO problem for optimal sparse CCA estimation, we propose a branch-and-bound algorithm based on the generalized eigenvalue problem for computing effective lower and upper bounds. We prove that our algorithm finds a solution with guaranteed optimality in terms of the canonical correlation. Computational results demonstrate that our method is much faster than direct application of optimization software to the MIO problem. Moreover, our method can provide better -quality solutions than can forward stepwise selection and L1-regularized estimation in terms of generalization performance. These results enhance the potential of optimal sparse estimation in multivariate statistical analyses.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available