4.7 Article

Robust Optimal Control for Disturbed Nonlinear Zero-Sum Differential Games Based on Single NN and Least Squares

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TSMC.2019.2897379

关键词

Game theory; Artificial neural networks; Games; Optimal control; Heuristic algorithms; Nonlinear systems; Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming (ADP); policy iteration (PI); robust control; zero-sum game (ZSG)

资金

  1. National Natural Science Foundation of China [61873300, 61722312]
  2. Fundamental Research Funds for the Central Universities [FRF-GF-17-B45]

向作者/读者索取更多资源

This paper establishes an approximate optimal critic learning algorithm based on single neural network (NN) policy iteration (PI) aiming at solving for continuous-time (CT) 2-player zero-sum games (ZSGs). In fact, we have to face the problem that the errors will disturb the dynamics and in turn identifying dynamics will generate errors. In order to prevent the effect of errors, in this paper, a single NN-based online PI algorithm is developed for the CT system, which is disturbed nonlinear ZSG. With plenty of online data, the Hamilton-Jacobi-Isaacs equation can be solved without complete dynamics. Then by the least-squares method, we can obtain the NN weights. Moreover, in the process of dealing with the undisturbed system, we find the way that obtains NN weights in this paper is equal to the way that obtains the optimal solution by the Gauss-Newton method. Based on the convergence of the Gauss-Newton method, we can efficiently obtain the optimal controller for the undisturbed system by utilizing online data. After getting the controller of the undisturbed system, it is time to take disturbance into consideration, so that we design a robust control pair to overcome the disturbance. In order to demonstrate the effectiveness of this algorithm, we design a set of simulations. The results verify that we can solve the disturbed nonlinear ZSG by this algorithm.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据