4.6 Article

Optimal control of nonlinear discrete time-varying systems using a new neural network approximation structure

期刊

NEUROCOMPUTING
卷 156, 期 -, 页码 157-165

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2014.12.067

关键词

Multiple-model; Optimal control; Adaptive self-organizing map; Reinforcement learning; Value function approximation

资金

  1. NSF [ECCS-1405173, IIS-1208623]
  2. ONR [N00014-13-1-0562, N000141410718]
  3. ARO [W911NF-11-D-0001]
  4. China NNSF grant [61120106011]
  5. China Education Ministry Project 111 [B08015]
  6. Div Of Electrical, Commun & Cyber Sys
  7. Directorate For Engineering [1405173, 1128050] Funding Source: National Science Foundation
  8. Div Of Information & Intelligent Systems
  9. Direct For Computer & Info Scie & Enginr [1208623] Funding Source: National Science Foundation

向作者/读者索取更多资源

In this paper motivated by recently discovered neurocognitive models of mechanisms in the brain, a new reinforcement learning (RL) method is presented based on a novel critic neural network (NN) structure to solve the optimal tracking problem of a nonlinear discrete time-varying system in an online manner. A multiple-model approach combined with an adaptive self-organizing map (ASOM) neural network is used to detect changes in the dynamics of the system. The number of sub-models is determined adaptively and grows once a mismatch between the stored sub-models and the new data is detected. By using the ASOM neural network, a novel value function approximation (VFA) scheme is presented. Each sub-model contributes into the value function based on a responsibility signal obtained by the ASOM. The responsibility signal determines how much each sub-model contributes to the general value function. Novel policy iteration and the value iteration algorithms are presented to find the optimal control for the partially-unknown nonlinear discrete time-varying systems in an online manner. Simulation results demonstrate the effectiveness of the proposed control scheme. (C) 2014 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据