4.6 Article

APPROXIMATING OPTIMAL FEEDBACK CONTROLLERS OF FINITE HORIZON CONTROL PROBLEMS USING HIERARCHICAL TENSOR FORMATS

期刊

SIAM JOURNAL ON SCIENTIFIC COMPUTING
卷 44, 期 3, 页码 B746-B770

出版社

SIAM PUBLICATIONS
DOI: 10.1137/21M1412190

关键词

feedback control; Bellman equation; tensor train; Pontryagin maximum principle; model predictive control; policy iteration

资金

  1. German Research Foundation (DFG) via the Research Training Group DAEDALUS [GRK 2433]

向作者/读者索取更多资源

This paper discusses the finite horizon control problem in ordinary differential equation systems, and presents two different methods for solving it: policy iteration and model predictive control. For high-dimensional systems, low-rank tensor approximation and high-dimensional quadrature methods are used for numerical solution, and the effectiveness of the methods is verified through examples.
Controlling systems of ordinary differential equations is ubiquitous in science and en-gineering. For finding an optimal feedback controller, the value function and associated fundamental equations such as the Bellman equation and the Hamilton-Jacobi-Bellman equation are essential. The numerical treatment of these equations poses formidable challenges due to their non-linearity and their (possibly) high dimensionality. In this paper we consider a finite horizon control sys-tem with associated Bellman equation. After a time discretization, we obtain a sequence of short time horizon problems which we call local optimal control problems. For solving the local optimal control problems we apply two different methods; one is the well-known policy iteration, where a fixed-point iteration is required for every time step. The other algorithm borrows ideas from model predictive control by solving the local optimal control problem via open-loop control methods on a short time horizon, allowing us to replace the fixed-point iteration by an adjoint method. For high -dimensional systems we apply low-rank hierarchical tensor product approximation/tree-based tensor formats, in particular tensor trains and multipolynomials, together with high-dimensional quadra-ture, e.g., Monte Carlo. We prove a linear error propagation with respect to the time discretization and give numerical evidence by controlling a diffusion equation with an unstable reaction term and an Allen--Cahn equation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据