4.7 Article

Multiple Actor-Critic Structures for Continuous-Time Optimal Control Using Input-Output Data

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2015.2399020

Keywords

Actor-critic; approximate dynamic programming (ADP); category; optimal control; shunting inhibitory artificial neural network (SIANN)

Funding

  1. National Science Foundation [ECCS-1128050]
  2. Beijing Natural Science Foundation [4143065, 4132078]
  3. National Natural Science Foundation of China [61304079, 61433004, 61374105, 61120106011]
  4. China Post-Doctoral Science Foundation [2013M530527]
  5. Fundamental Research Funds for the Central Universities [FRF-TP-14-119A2]
  6. Open Research Project through the State Key Laboratory of Management and Control for Complex Systems [20120106]
  7. Office of Naval Research, Arlington, VA, USA [N00014-13-1-0562]
  8. Air Force Office of Scientific Research through the European Office of Aerospace Research and Development [13-3055]
  9. U.S. Army Research Office [W911NF-11-D-0001]
  10. Ministry of Education, China, through the 111 Project [B08015]
  11. Direct For Computer & Info Scie & Enginr
  12. Div Of Information & Intelligent Systems [1208623] Funding Source: National Science Foundation
  13. Directorate For Engineering
  14. Div Of Electrical, Commun & Cyber Sys [1405173, 1128050] Funding Source: National Science Foundation
  15. Directorate For Engineering
  16. Div Of Electrical, Commun & Cyber Sys [1101401] Funding Source: National Science Foundation

Ask authors/readers for more resources

In industrial process control, there may be multiple performance objectives, depending on salient features of the input-output data. Aiming at this situation, this paper proposes multiple actor-critic structures to obtain the optimal control via input-output data for unknown nonlinear systems. The shunting inhibitory artificial neural network (SIANN) is used to classify the input-output data into one of several categories. Different performance measure functions may be defined for disparate categories. The approximate dynamic programming algorithm, which contains model module, critic network, and action network, is used to establish the optimal control in each category. A recurrent neural network (RNN) model is used to reconstruct the unknown system dynamics using input-output data. NNs are used to approximate the critic and action networks, respectively. It is proven that the model error and the closed unknown system are uniformly ultimately bounded. Simulation results demonstrate the performance of the proposed optimal control scheme for the unknown nonlinear system.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available