☆ 4.6 Article

Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems

IEEE TRANSACTIONS ON CYBERNETICS (2021)

Journal

IEEE TRANSACTIONS ON CYBERNETICS

Volume 51, Issue 5, Pages 2372-2383

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TCYB.2020.2979614

Keywords

Optimal control; Nonlinear systems; Decentralized control; Mathematical model; Convergence; Multi-agent systems; Adaptive dynamic programming (ADP); approximate dynamic programming; distributed policy iteration; nonlinear systems; optimal control

Funding

National Natural Science Foundation of China [61722312]
National Science Foundation [ECCS 1917275]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

A novel distributed policy iteration algorithm is proposed for infinite horizon optimal control problems of continuous-time nonlinear systems. By improving the control law iteratively one by one, the computational burden in each iteration is effectively reduced. The properties of the distributed policy iteration algorithm, including monotonicity, convergence, and optimality, have been analyzed in detail.

In this article, a novel distributed policy iteration algorithm is established for infinite horizon optimal control problems of continuous-time nonlinear systems. In each iteration of the developed distributed policy iteration algorithm, only one controller's control law is updated and the other controllers' control laws remain unchanged. The main contribution of the present algorithm is to improve the iterative control law one by one, instead of updating all the control laws in each iteration of the traditional policy iteration algorithms, which effectively releases the computational burden in each iteration. The properties of distributed policy iteration algorithm for continuous-time nonlinear systems are analyzed. The admissibility of the present methods has also been analyzed. Monotonicity, convergence, and optimality have been discussed, which show that the iterative value function is nonincreasingly convergent to the solution of the Hamilton-Jacobi-Bellman equation. Finally, numerical simulations are conducted to illustrate the effectiveness of the proposed method.

Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems

Journal

IEEE TRANSACTIONS ON CYBERNETICS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems

Journal

IEEE TRANSACTIONS ON CYBERNETICS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper