4.6 Article

Optimal consensus control for multi-agent systems: Multi-step policy gradient adaptive dynamic programming method

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Artificial Intelligence

Model-Free λ-Policy Iteration for Discrete-Time Linear Quadratic Regulation

Yongliang Yang et al.

Summary: This article presents a model-free lambda-policy iteration algorithm for the discrete-time linear quadratic regulation problem. It introduces the weighted Bellman operator and the composite Bellman operator to solve the algebraic Riccati equation. Compared to the PI algorithm, the lambda-PI algorithm does not require an initial policy and has a faster convergence rate. The model-free extension of the lambda-PI algorithm is developed using the off-policy reinforcement learning technique.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Adaptive Fuzzy Leader-Follower Synchronization of Constrained Heterogeneous Multiagent Systems

Yongliang Yang et al.

Summary: This article investigates the distributed adaptive neuro-fuzzy output feedback control protocol design for the output synchronization problem in heterogeneous multiagent systems with nonlinear strict-feedback agent dynamics. It proposes a distributed high-gain observer to estimate the unmeasured agent state and relax the Lipschitz continuity requirement. An asymmetric barrier Lyapunov function is presented to handle transient and steady-state constraints on the output synchronization error. The use of dynamic surface control technique, fuzzy logic system, and a distributed auxiliary dynamical system contributes to the overall control performance.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2022)

Article Automation & Control Systems

Policy Gradient Adaptive Critic Designs for Model-Free Optimal Tracking Control With Experience Replay

Mingduo Lin et al.

Summary: A model-free optimal tracking controller for discrete-time nonlinear systems is designed using policy gradient adaptive critic designs and experience replay, aiming to improve control performance.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Optimal Tracking Control of Nonlinear Multiagent Systems Using Internal Reinforce Q-Learning

Zhinan Peng et al.

Summary: This article introduces a novel reinforcement learning method IrQ-L to solve the optimal tracking control problem of unknown nonlinear multiagent systems, utilizing internal reinforcement reward and Q functions to achieve optimal distributed control schemes, along with establishing a distributed online learning framework.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning

Xindi Yang et al.

Summary: This article introduces a data-based distributed control algorithm to address the consensus control problem in multiagent systems, successfully overcoming the challenges of asynchronous learning. By incorporating an actor-critic structure and neural networks, the algorithm achieves convergence and optimality in both synchronous and asynchronous cases.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Automation & Control Systems

Visuomotor Reinforcement Learning for Multirobot Cooperative Navigation

Zhe Liu et al.

Summary: This article presents an end-to-end learning framework using graph neural networks and deep reinforcement learning to address the multirobot cooperative navigation problem. Experimental results demonstrate its superior performance, generalizability to unseen environments, and scalability to a larger number of training robots.

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING (2022)

Article Computer Science, Artificial Intelligence

Reinforcement Learning-Based Cooperative Optimal Output Regulation via Distributed Adaptive Internal Model

Weinan Gao et al.

Summary: This article introduces a novel data-driven distributed control method for solving the cooperative optimal output regulation problem of leader-follower multiagent systems. The method incorporates a distributed adaptive internal model and reinforcement learning algorithms to learn optimal controller and estimate leader's state values online. By combining these methods, a foundation for connecting data-distributed control methods with adaptive dynamic programming approaches is established.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Automation & Control Systems

Adaptive Dynamic Programming for Control: A Survey and Recent Advances

Derong Liu et al.

Summary: This article reviews the recent development of adaptive dynamic programming (ADP) with applications in control, highlighting efficient algorithms and future research directions. ADP is applied in optimization, game theory, and large-scale systems, showing great potential in the era of artificial intelligence.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2021)

Article Computer Science, Artificial Intelligence

A Parallel Framework of Adaptive Dynamic Programming Algorithm With Off-Policy Learning

Changyin Sun et al.

Summary: This article proposes a model-free online adaptive dynamic programming approach for solving optimal control problems of nonaffine nonlinear systems. The method combines off-policy learning mechanism with parallel paradigm, utilizing multithread agents to collect data and improve exploration capability. By using policy gradient method for policy improvement and actor-critic structure with neural networks for implementation, the algorithm demonstrates effectiveness through numerical examples.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2021)

Proceedings Paper Computer Science, Artificial Intelligence

The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning

Lingheng Meng et al.

Summary: Multi-step methods in reinforcement learning have been shown to improve performance, but the specific contributions to performance enhancement are still not well understood. This study analyzes the effect of multi-step methods on alleviating the overestimation problem in deep reinforcement learning, proposing two improved methods, MDDPG and MMDDPG, based on Deep Deterministic Policy Gradient.

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) (2021)

Article Engineering, Multidisciplinary

Resource Allocation for Delay-Sensitive Vehicle-to-Multi-Edges (V2Es) Communications in Vehicular Networks: A Multi-Agent Deep Reinforcement Learning Approach

Jing Wu et al.

Summary: The V2Es communication framework in vehicular networks improves service quality by utilizing edge node resources and reduces service latency effectively through reinforcement learning method learning dynamic communication states.

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING (2021)

Article Engineering, Electrical & Electronic

Distributed Optimal Consensus Control Algorithm for Continuous-Time Multi-Agent Systems

Qishao Wang et al.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS (2020)

Article Automation & Control Systems

Optimal Output Regulation of Linear Discrete-Time Systems With Unknown Dynamics Using Reinforcement Learning

Yi Jiang et al.

IEEE TRANSACTIONS ON CYBERNETICS (2020)

Article Engineering, Multidisciplinary

Consensus and H∞ Consensus of Nonlinear Second-Order Multi-Agent Systems

Qing Wang et al.

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING (2020)

Article Engineering, Electrical & Electronic

A Data-Driven Multi-Agent Autonomous Voltage Control Framework Using Deep Reinforcement Learning

Shengyi Wang et al.

IEEE TRANSACTIONS ON POWER SYSTEMS (2020)

Article Automation & Control Systems

Balancing Value Iteration and Policy Iteration for Discrete-Time Control

Biao Luo et al.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2020)

Article Automation & Control Systems

Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation

Biao Luo et al.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2019)

Article Computer Science, Information Systems

Data-driven optimal tracking control of discrete-time multi-agent systems with two-stage policy iteration algorithm

Zhinan Peng et al.

INFORMATION SCIENCES (2019)

Article Computer Science, Artificial Intelligence

Adaptive Optimal Output Regulation of Time-Delay Systems via Measurement Feedback

Weinan Gao et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2019)

Article Automation & Control Systems

Data-Driven Optimal Consensus Control for Discrete-Time Multi-Agent Systems With Unknown Dynamics Using Reinforcement Learning Method

Huaguang Zhang et al.

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS (2017)

Article Automation & Control Systems

Distributed algorithm for dynamic economic power dispatch with energy storage in smart grids

Hao Xing et al.

IET CONTROL THEORY AND APPLICATIONS (2017)

Article Automation & Control Systems

Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control

Biao Luo et al.

IEEE TRANSACTIONS ON CYBERNETICS (2017)

Article Engineering, Electrical & Electronic

A Distributed Multi-Agent Based Emergency Control Approach Following Catastrophic Disturbances in Interconnected Power Systems

Sk. Razibul Islam et al.

IEEE TRANSACTIONS ON POWER SYSTEMS (2016)

Article Automation & Control Systems

Complete stability analysis of a heuristic approximate dynamic programming control design

Yury Sokolov et al.

AUTOMATICA (2015)

Article Automation & Control Systems

Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems

Derong Liu et al.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2015)

Article Automation & Control Systems

Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework

Mohamed A. Khamis et al.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2014)

Article Automation & Control Systems

Cooperative Optimal Control for Multi-Agent Systems on Directed Graph Topologies

Kristian Hengster-Movric et al.

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2014)

Article Computer Science, Artificial Intelligence

Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems

Derong Liu et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2014)

Article Automation & Control Systems

Optimal consensus control of linear multi-agent systems with communication time delay

Jie Sheng et al.

IET CONTROL THEORY AND APPLICATIONS (2013)

Article Automation & Control Systems

Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations

Kyriakos G. Vamvoudakis et al.

AUTOMATICA (2011)

Article Automation & Control Systems

Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof

Asma Al-Tamimi et al.

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS (2008)

Review Computer Science, Artificial Intelligence

A comprehensive survey of multiagent reinforcement learning

Lucian Busoniu et al.

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS (2008)