4.7 Article

Reinforcement Learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2019.2954983

关键词

Nonlinear systems; Optimal control; Artificial neural networks; Actuators; Observers; Feedforward systems; Adaptive dynamic programming (ADP); neural networks (NNs); optimal control; reinforcement learning (RL); uncertain input constraints; unknown nonlinear systems

资金

  1. National Natural Science Foundation of China [61973330, 61603387, 61773075, 61533017, U1501251]
  2. Fundamental Research Funds for the Central Universities [2019NTST25]
  3. Early Career Development Award of SKLMCCS [20180201]
  4. State Key Laboratory of Synthetical Automation for Process Industries [2019-KF-23-03]

向作者/读者索取更多资源

This article presents a novel reinforcement learning strategy that addresses an optimal stabilizing problem for unknown nonlinear systems subject to uncertain input constraints. The control algorithm is composed of two parts, i.e., online learning optimal control for the nominal system and feedforward neural networks (NNs) compensation for handling uncertain input constraints, which are considered as the saturation nonlinearities. Integrating the input-output data and recurrent NN, a Luenberger observer is established to approximate the unknown system dynamics. For nominal systems without input constraints, the online learning optimal control policy is derived by solving Hamilton-Jacobi-Bellman equation via a critic NN alone. By transforming the uncertain input constraints to saturation nonlinearities, the uncertain input constraints can be compensated by employing a feedforward NN compensator. The convergence of the closed-loop system is guaranteed to be uniformly ultimately bounded by using the Lyapunov stability analysis. Finally, the effectiveness of the developed stabilization scheme is illustrated by simulation studies.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据