☆ 4.6 Article

ON STOCHASTIC AND DETERMINISTIC QUASI-NEWTON METHODS FOR NONSTRONGLY CONVEX OPTIMIZATION: ASYMPTOTIC CONVERGENCE AND RATE ANALYSIS

SIAM JOURNAL ON OPTIMIZATION (2020)

期刊

SIAM JOURNAL ON OPTIMIZATION

卷 30, 期 2, 页码 1144-1172

出版社

SIAM PUBLICATIONS

DOI: 10.1137/17M1152474

关键词

stochastic optimization; quasi-Newton; regularization; large scale optimization

类别

Mathematics, Applied

资金

NSF [CCF-1717391]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Motivated by applications arising from large-scale optimization and machine learning, we consider stochastic quasi-Newton (SQN) methods for solving unconstrained convex optimization problems. Much of the convergence analysis of SQN methods, in both full and limited-memory regimes, requires the objective function to be strongly convex. However, this assumption is fairly restrictive and does not hold in many applications. To the best of our knowledge, no rate statements currently exist for SQN methods in the absence of such an assumption. Furthermore, among the existing first-order methods for addressing stochastic optimization problems with merely convex objectives, techniques equipped with provable convergence rates employ averaging. However, this averaging technique has a detrimental impact on inducing sparsity. Motivated by these gaps, we consider optimization problems with non-strongly convex objectives with Lipschitz but possibly unbounded gradients. The main contributions of the paper are as follows: (i) To address large-scale stochastic optimization problems, we develop an iteratively regularized stochastic limited-memory BFGS (IRS-LBFGS) algorithm, where the step size, regularization parameter, and the Hessian inverse approximation are updated iteratively. We establish convergence of the iterates (with no averaging) to an optimal solution of the original problem both in an almost-sure sense and in a mean sense. The convergence rate is derived in terms of the objective function value and is shown to be O(1/k((1)(/3-epsilon))), where epsilon is an arbitrary small positive scalar. (ii) In deterministic regimes, we show that the algorithm displays a rate O(1/k(1-)(epsilon)). We present numerical experiments performed on a large-scale text classification problem and compare IRS-LBFGS with standard SQN methods as well as first-order methods such as SAGA and IAG.

ON STOCHASTIC AND DETERMINISTIC QUASI-NEWTON METHODS FOR NONSTRONGLY CONVEX OPTIMIZATION: ASYMPTOTIC CONVERGENCE AND RATE ANALYSIS

期刊

SIAM JOURNAL ON OPTIMIZATION

出版社

SIAM PUBLICATIONS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

ON STOCHASTIC AND DETERMINISTIC QUASI-NEWTON METHODS FOR NONSTRONGLY CONVEX OPTIMIZATION: ASYMPTOTIC CONVERGENCE AND RATE ANALYSIS

期刊

SIAM JOURNAL ON OPTIMIZATION

出版社

SIAM PUBLICATIONS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文