期刊
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS
卷 355, 期 5, 页码 2610-2630出版社
PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.jfranklin.2018.02.001
关键词
-
类别
资金
- National Natural Science Foundation of China [61433004, 61627809, 61621004]
- IAPI Fundamental Research Funds [2013ZCX14]
In this paper, a novel iterative approximate dynamic programming scheme is proposed by introducing the learning mechanism of value iteration (VI) to solve the constrained optimal control problem for CT affine nonlinear systems with utilizing only one neural network. The idea is to show the feasibility of introducing the VI learning mechanism to solve for the constrained optimal control problem from a theoretical point of view, and thus the initial admissible control can be avoided compared with most existing works based on policy iteration (PI). Meanwhile, the initial condition of the proposed VI based method can be more general than the traditional VI method which requires the initial value function to be a zero function. A general analytical method is proposed to demonstrate the convergence property. To simplify the architecture, only one critic neural network is adopted to approximate the iterative value function while implementing the proposed method. At last, two simulation examples are proposed to validate the theoretical results. (C) 2018 The Franklin Institute. Published by Elsevier Ltd. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据