期刊
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
卷 50, 期 11, 页码 4123-4134出版社
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TSMC.2018.2883801
关键词
Adaptive dynamic programming (ADP); model-free; multiagent systems (MASs); optimal consensus control; Q-function
资金
- National Natural Science Foundation of China [61473316]
- Hubei Provincial Natural Science Foundation of China [2017CFA030, 2015CFA010]
- 111 Project [B17040]
Conventionally, as the system's dynamics is known, the optimal consensus control problem relies on solving the coupled Hamilton-Jacobi-Bellman (HJB) equations. In this paper, with the unknown system dynamics being considered, a local Q-function-based adaptive dynamic programming method is put forward to deal with the optimal consensus control problem for unknown discrete-time nonlinear multiagent systems by approximating the solutions of the coupled HJB equations. First, a local Q-function is defined, which considers the local consensus error and the actions of the agent and its neighbors. Using the Q-function, it is convenient to get the derivatives with regard to the weights of the consensus control policies, even without the model of system dynamics. Then, with the defined local Q-function, a distributed policy iteration technique is developed, which is theoretically proved to be convergent to the solutions of the coupled HJB equations. An actor-critic neural network framework for implementing the developed model-free optimal consensus control method is constructed to approximate the local Q-functions and the control policies. Finally, the feasibility and effectiveness of the developed method are verified by a series of simulations.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据