☆ 4.6 Article

A control-theoretic perspective on optimal high-order optimization

MATHEMATICAL PROGRAMMING (2022)

期刊

MATHEMATICAL PROGRAMMING

卷 195, 期 1-2, 页码 929-975

出版社

SPRINGER HEIDELBERG

DOI: 10.1007/s10107-021-01721-3

关键词

Convex optimization; Optimal acceleration; Closed-loop control system; Feedback control; High-order tensor algorithm; Iteration complexity

类别

Computer Science, Software Engineering Operations Research & Management Science Mathematics, Applied

资金

Mathematical Data Science program of the Office of Naval Research [N00014-18-1-2764]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The study presents an optimal tensor algorithm from a control-theoretic perspective, proving the existence and uniqueness of local and global solutions, analyzing convergence properties, and demonstrating the fundamental role of feedback control in optimal acceleration. The analysis shows that all discussed p-th order optimal tensor algorithms minimize the squared gradient norm at a rate of O(k(-3p)), complementing recent studies in the field.

We provide a control-theoretic perspective on optimal tensor algorithms for minimizing a convex function in a finite-dimensional Euclidean space. Given a function Phi : R-d -> R that is convex and twice continuously differentiable, we study a closed-loop control system that is governed by the operators del Phi and del(2)Phi together with a feedback control law lambda(.) satisfying the algebraic equation lambda(t))(p) parallel to del Phi(x(t))parallel to(p-1) = theta for some theta is an element of (0, 1). Our first contribution is to prove the existence and uniqueness of a local solution to this system via the Banach fixed-point theorem. We present a simple yet nontrivial Lyapunov function that allows us to establish the existence and uniqueness of a global solution under certain regularity conditions and analyze the convergence properties of trajectories. The rate of convergence is O(1/t((3P+1)/2)) in terms of objective function gap and O(1/t(3p)) in terms of squared gradient norm. Our second contribution is to provide two algorithmic frameworks obtained from discretization of our continuous-time system, one of which generalizes the large-step A-HPE framework of Monteiro and Svaiter (SIAM J Optim 23(2):1092-1125, 2013) and the other of which leads to a new optimal p-th order tensor algorithm. While our discrete-time analysis can be seen as a simplification and generalization of Monteiro and Svaiter (2013), it is largely motivated by the aforementioned continuous-time analysis, demonstrating the fundamental role that the feedback control plays in optimal acceleration and the clear advantage that the continuous-time perspective brings to algorithmic design. A highlight of our analysis is that we show that all of the p-th order optimal tensor algorithms that we discuss minimize the squared gradient norm at a rate of O(k(-3p)), which complements the recent analysis in Gasnikov et al. (in: COLT, PMLR, pp 1374-1391, 2019), Jiang et al. (in: COLT, PMLR, pp 1799-1801, 2019) and Bubeck et al. (in: COLT, PMLR, pp 492-507, 2019).

A control-theoretic perspective on optimal high-order optimization

期刊

MATHEMATICAL PROGRAMMING

出版社

SPRINGER HEIDELBERG

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A control-theoretic perspective on optimal high-order optimization

期刊

MATHEMATICAL PROGRAMMING

出版社

SPRINGER HEIDELBERG

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文