☆ 4.6 Article

A ZEROTH-ORDER PROXIMAL STOCHASTIC GRADIENT METHOD FOR WEAKLY CONVEX STOCHASTIC OPTIMIZATION

SIAM JOURNAL ON SCIENTIFIC COMPUTING (2023)

期刊

SIAM JOURNAL ON SCIENTIFIC COMPUTING

卷 45, 期 5, 页码 A2679-A2702

出版社

SIAM PUBLICATIONS

DOI: 10.1137/22M1494270

关键词

zeroth-order optimization; weakly convex stochastic optimization; stochastic gradient descent; hyperparameter tuning; composite optimization

类别

Mathematics, Applied

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this paper, we analyze a zeroth-order proximal stochastic gradient method for weakly convex stochastic optimization problems. The proposed algorithm utilizes Gaussian smoothing technique to estimate the zeroth-order gradient of a related partially smooth surrogate problem. This method does not require first-order information and provides state-of-the-art convergence rates.

In this paper we analyze a zeroth-order proximal stochastic gradient method suitable for the minimization of weakly convex stochastic optimization problems. We consider nonsmooth and nonlinear stochastic composite problems, for which (sub)gradient information might be unavailable. The proposed algorithm utilizes the well-known Gaussian smoothing technique, which yields unbiased zeroth-order gradient estimators of a related partially smooth surrogate problem (in which one of the two nonsmooth terms in the original problem's objective is replaced by a smooth approximation). This allows us to employ a standard proximal stochastic gradient scheme for the approximate solution of the surrogate problem, which is determined by a single smoothing parameter, and without the utilization of first-order information. We provide state-of-the-art convergence rates for the proposed zeroth-order method using minimal assumptions. The proposed scheme is numerically compared against alternative zeroth-order methods as well as a stochastic subgradient scheme on a standard phase retrieval problem. Further, we showcase the usefulness and effectiveness of our method in the unique setting of automated hyperparameter tuning. In particular, we focus on automatically tuning the parameters of optimization algorithms by minimizing a novel heuristic model. The proposed approach is tested on a proximal alternating direction method of multipliers for the solution of L-1/L-2-regularized PDE-constrained optimal control problems, with evident empirical success.

A ZEROTH-ORDER PROXIMAL STOCHASTIC GRADIENT METHOD FOR WEAKLY CONVEX STOCHASTIC OPTIMIZATION

期刊

SIAM JOURNAL ON SCIENTIFIC COMPUTING

出版社

SIAM PUBLICATIONS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A ZEROTH-ORDER PROXIMAL STOCHASTIC GRADIENT METHOD FOR WEAKLY CONVEX STOCHASTIC OPTIMIZATION

期刊

SIAM JOURNAL ON SCIENTIFIC COMPUTING

出版社

SIAM PUBLICATIONS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文