4.6 Article

Adaptive Fault-Tolerant Tracking Control for Discrete-Time Multiagent Systems via Reinforcement Learning Algorithm

期刊

IEEE TRANSACTIONS ON CYBERNETICS
卷 51, 期 3, 页码 1163-1174

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2020.2982168

关键词

Reinforcement learning; Artificial neural networks; Actuators; Fault tolerance; Fault tolerant systems; Estimation; Multi-agent systems; Discrete-time multiagent systems (MASs); fault-tolerant control; neural networks (NNs); reinforcement learning algorithm

资金

  1. Local Innovative and Research Teams Project of Guangdong Special Support Program of 2019
  2. Innovative Research Team Program of Guangdong Province Science Foundation [2018B030312006]
  3. Science and Technology Program of Guangzhou [201904020006]

向作者/读者索取更多资源

This article investigates the adaptive fault-tolerant tracking control problem for a class of discrete-time multiagent systems via a reinforcement learning algorithm. The direct adaptive optimal controllers are designed by combining the backstepping technique with the reinforcement learning algorithm to reduce computational burden, and adaptive auxiliary signals are established to compensate for the influence of dead zones and actuator faults.
This article investigates the adaptive fault-tolerant tracking control problem for a class of discrete-time multiagent systems via a reinforcement learning algorithm. The action neural networks (NNs) are used to approximate unknown and desired control input signals, and the critic NNs are employed to estimate the cost function in the design procedure. Furthermore, the direct adaptive optimal controllers are designed by combining the backstepping technique with the reinforcement learning algorithm. Comparing the existing reinforcement learning algorithm, the computational burden can be effectively reduced by using the method of less learning parameters. The adaptive auxiliary signals are established to compensate for the influence of the dead zones and actuator faults on the control performance. Based on the Lyapunov stability theory, it is proved that all signals of the closed-loop system are semiglobally uniformly ultimately bounded. Finally, some simulation results are presented to illustrate the effectiveness of the proposed approach.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据