4.7 Article

H∞ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2015.2441749

Keywords

Bounded L-2-gain; H-infinity tracking controller; reinforcement learning (RL); tracking Hamilton-Jacobi-Isaac (HJI) equation

Funding

  1. National Science Foundation (NSF) [ECCS-1405173, IIS-1208623]
  2. Office of Naval Research, Arlington, VA, USA [N00014-13-1-0562, N000141410718]
  3. U.S. Army Research Office [W911NF-11-D-0001]
  4. NSF [ECCS-1101401, ECCS-1230040]
  5. Directorate For Engineering
  6. Div Of Electrical, Commun & Cyber Sys [1501044] Funding Source: National Science Foundation
  7. Directorate For Engineering
  8. Div Of Electrical, Commun & Cyber Sys [1405173] Funding Source: National Science Foundation

Ask authors/readers for more resources

This paper deals with the design of an H-infinity tracking controller for nonlinear continuous-time systems with completely unknown dynamics. A general bounded L2-gain tracking problem with a discounted performance function is introduced for the H-infinity tracking. A tracking Hamilton-Jacobi-Isaac (HJI) equation is then developed that gives a Nash equilibrium solution to the associated min-max optimization problem. A rigorous analysis of bounded L-2-gain and stability of the control solution obtained by solving the tracking HJI equation is provided. An upper-bound is found for the discount factor to assure local asymptotic stability of the tracking error dynamics. An off-policy reinforcement learning algorithm is used to learn the solution to the tracking HJI equation online without requiring any knowledge of the system dynamics. Convergence of the proposed algorithm to the solution to the tracking HJI equation is shown. Simulation examples are provided to verify the effectiveness of the proposed method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available