Journal
2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC)
Volume -, Issue -, Pages 7474-7479Publisher
IEEE
Keywords
Linear quadratic regulator; gradient descent; gradient-flow dynamics; model-free control; nonconvex optimization; Polyak-Lojasiewicz inequality
Funding
- National Science Foundation [ECCS-1809833]
- Air Force Office of Scientific Research [FA9550-16-1-0009]
Ask authors/readers for more resources
In large-scale and model-free settings, first-order algorithms are often used in an attempt to find the optimal control action without identifying the underlying dynamics. The convergence properties of these algorithms remain poorly understood because of nonconvexity. In this paper, we revisit the continuous-time linear quadratic regulator problem and take a step towards demystifying the efficiency of gradient-based strategies. Despite the lack of convexity, we establish a linear rate of convergence to the globally optimal solution for the gradient descent algorithm. The key component of our analysis is that we relate the gradient-flow dynamics associated with the nonconvex formulation to that of a convex reparameterization. This allows us to provide convergence guarantees for the nonconvex approach from its convex counterpart.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available