4.7 Article

Simultaneous policy update algorithms for learning the solution of linear continuous-time H∞ state feedback control

Journal

INFORMATION SCIENCES
Volume 222, Issue -, Pages 472-485

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2012.08.012

Keywords

Simultaneous policy update algorithm; H-infinity state feedback control; Algebra Riccati equation; Lyapunov equation; Offline; Online

Funding

  1. National Basic Research Program of China (973 Program) [2012CB720003]
  2. National Natural Science Foundation of China [61074057, 61121003, 61004023, 91016004]

Ask authors/readers for more resources

It is well known that the H-infinity state feedback control problem can be viewed as a two-player zero-sum game and reduced to find a solution of the algebra Riccati equation (ARE). In this paper, we propose a simultaneous policy update algorithm (SPUA) for solving the ARE, and develop offline and online versions. The offline SPUA is a model-based approach, which obtains the solution of the ARE by solving a sequence of Lyapunov equations (LEs). Its convergence is established rigorously by constructing a Newton's sequence for the fixed point equation. The online SPUA is a partially model-free approach, which takes advantage of the thought of reinforcement learning (RL) to learn the solution of the ARE online without requiring the internal system dynamics, wherein both players update their action policies simultaneously. The convergence of the online SPUA is proved by demonstrating that it is mathematically equivalent to the offline SPUA. Finally, by conducting comparative simulation studies on an F-16 aircraft plant and a power system, the results show that both the offline SPUA and the online SPUA can find the solution of the ARE, and achieve much better convergence than the existing methods. (C) 2012 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available