4.7 Article

Robust matrix estimations meet Frank-Wolfe algorithm

Journal

MACHINE LEARNING
Volume 112, Issue 7, Pages 2723-2760

Publisher

SPRINGER
DOI: 10.1007/s10994-023-06325-w

Keywords

Frank-Wolfe algorithms; Huber loss; Matrix-valued parameters; Robust statistical methods; Non-asymptotic properties; Non-smooth criterion function

Ask authors/readers for more resources

We focus on estimating matrix-valued model parameters in large-scale structured data and propose a highly efficient computing scheme using projection-free Frank-Wolfe algorithms. Our methodological framework accommodates robust loss functions and penalty functions in matrix estimation problems. The non-asymptotic error bounds of matrix estimations using Huber loss and nuclear norm penalty are established in two specific cases, demonstrating the advantages of using robust loss functions. Extensive numerical examples and data analysis support the promising performance of our methods.
We consider estimating matrix-valued model parameters with a dedicated focus on their robustness. Our setting concerns large-scale structured data so that a regularization on the matrix's rank becomes indispensable. Though robust loss functions are expected to be effective, their practical implementations are known difficult due to the non-smooth criterion functions encountered in the optimizations. To meet the challenges, we develop a highly efficient computing scheme taking advantage of the projection-free Frank-Wolfe algorithms that require only the first-order derivative of the criterion function. Our methodological framework is broad, extensively accommodating robust loss functions in conjunction with penalty functions in the context of matrix estimation problems. We establish the non-asymptotic error bounds of the matrix estimations with the Huber loss and nuclear norm penalty in two concrete cases: matrix completion with partial and noisy observations and reduced-rank regressions. Our theory demonstrates the merits from using robust loss functions, so that matrix-valued estimators with good properties are achieved even when heavy-tailed distributions are involved. We illustrate the promising performance of our methods with extensive numerical examples and data analysis.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available