4.7 Article

Model-based reinforcement learning for approximate optimal regulation

Journal

AUTOMATICA
Volume 64, Issue -, Pages 94-104

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.automatica.2015.10.039

Keywords

Model-based reinforcement learning; Concurrent learning; Simulated experience; Data-based control; Adaptive control; System identification

Funding

  1. National Science Foundation [1509516]
  2. Office of Naval Research [N00014-13-1-0151]
  3. Directorate For Engineering
  4. Div Of Electrical, Commun & Cyber Sys [1509516] Funding Source: National Science Foundation
  5. Div Of Civil, Mechanical, & Manufact Inn
  6. Directorate For Engineering [1161260] Funding Source: National Science Foundation

Ask authors/readers for more resources

Reinforcement learning (RL)-based online approximate optimal control methods applied to deterministic systems typically require a restrictive persistence of excitation (PE) condition for convergence. This paper develops a concurrent learning (CL)-based implementation of model-based RL to solve approximate optimal regulation problems online under a PE-like rank condition. The development is based on the observation that, given a model of the system, RL can be implemented by evaluating the Bellman error at any number of desired points in the state space. In this result, a parametric system model is considered, and a CL-based parameter identifier is developed to compensate for uncertainty in the parameters. Uniformly ultimately bounded regulation of the system states to a neighborhood of the origin, and convergence of the developed policy to a neighborhood of the optimal policy are established using a Lyapunov-based analysis, and simulation results are presented to demonstrate the performance of the developed controller. (C) 2015 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available