4.6 Article

A control-theoretic perspective on optimal high-order optimization

期刊

MATHEMATICAL PROGRAMMING
卷 195, 期 1-2, 页码 929-975

出版社

SPRINGER HEIDELBERG
DOI: 10.1007/s10107-021-01721-3

关键词

Convex optimization; Optimal acceleration; Closed-loop control system; Feedback control; High-order tensor algorithm; Iteration complexity

资金

  1. Mathematical Data Science program of the Office of Naval Research [N00014-18-1-2764]

向作者/读者索取更多资源

The study presents an optimal tensor algorithm from a control-theoretic perspective, proving the existence and uniqueness of local and global solutions, analyzing convergence properties, and demonstrating the fundamental role of feedback control in optimal acceleration. The analysis shows that all discussed p-th order optimal tensor algorithms minimize the squared gradient norm at a rate of O(k(-3p)), complementing recent studies in the field.
We provide a control-theoretic perspective on optimal tensor algorithms for minimizing a convex function in a finite-dimensional Euclidean space. Given a function Phi : R-d -> R that is convex and twice continuously differentiable, we study a closed-loop control system that is governed by the operators del Phi and del(2)Phi together with a feedback control law lambda(.) satisfying the algebraic equation lambda(t))(p) parallel to del Phi(x(t))parallel to(p-1) = theta for some theta is an element of (0, 1). Our first contribution is to prove the existence and uniqueness of a local solution to this system via the Banach fixed-point theorem. We present a simple yet nontrivial Lyapunov function that allows us to establish the existence and uniqueness of a global solution under certain regularity conditions and analyze the convergence properties of trajectories. The rate of convergence is O(1/t((3P+1)/2)) in terms of objective function gap and O(1/t(3p)) in terms of squared gradient norm. Our second contribution is to provide two algorithmic frameworks obtained from discretization of our continuous-time system, one of which generalizes the large-step A-HPE framework of Monteiro and Svaiter (SIAM J Optim 23(2):1092-1125, 2013) and the other of which leads to a new optimal p-th order tensor algorithm. While our discrete-time analysis can be seen as a simplification and generalization of Monteiro and Svaiter (2013), it is largely motivated by the aforementioned continuous-time analysis, demonstrating the fundamental role that the feedback control plays in optimal acceleration and the clear advantage that the continuous-time perspective brings to algorithmic design. A highlight of our analysis is that we show that all of the p-th order optimal tensor algorithms that we discuss minimize the squared gradient norm at a rate of O(k(-3p)), which complements the recent analysis in Gasnikov et al. (in: COLT, PMLR, pp 1374-1391, 2019), Jiang et al. (in: COLT, PMLR, pp 1799-1801, 2019) and Bubeck et al. (in: COLT, PMLR, pp 492-507, 2019).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据