4.6 Article

Optimal control of nonlinear discrete time-varying systems using a new neural network approximation structure

Journal

NEUROCOMPUTING
Volume 156, Issue -, Pages 157-165

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2014.12.067

Keywords

Multiple-model; Optimal control; Adaptive self-organizing map; Reinforcement learning; Value function approximation

Funding

  1. NSF [ECCS-1405173, IIS-1208623]
  2. ONR [N00014-13-1-0562, N000141410718]
  3. ARO [W911NF-11-D-0001]
  4. China NNSF grant [61120106011]
  5. China Education Ministry Project 111 [B08015]
  6. Div Of Electrical, Commun & Cyber Sys
  7. Directorate For Engineering [1405173, 1128050] Funding Source: National Science Foundation
  8. Div Of Information & Intelligent Systems
  9. Direct For Computer & Info Scie & Enginr [1208623] Funding Source: National Science Foundation

Ask authors/readers for more resources

In this paper motivated by recently discovered neurocognitive models of mechanisms in the brain, a new reinforcement learning (RL) method is presented based on a novel critic neural network (NN) structure to solve the optimal tracking problem of a nonlinear discrete time-varying system in an online manner. A multiple-model approach combined with an adaptive self-organizing map (ASOM) neural network is used to detect changes in the dynamics of the system. The number of sub-models is determined adaptively and grows once a mismatch between the stored sub-models and the new data is detected. By using the ASOM neural network, a novel value function approximation (VFA) scheme is presented. Each sub-model contributes into the value function based on a responsibility signal obtained by the ASOM. The responsibility signal determines how much each sub-model contributes to the general value function. Novel policy iteration and the value iteration algorithms are presented to find the optimal control for the partially-unknown nonlinear discrete time-varying systems in an online manner. Simulation results demonstrate the effectiveness of the proposed control scheme. (C) 2014 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available