☆ 4.7 Article

Reinforcement Learning Behavioral Control for Nonlinear Autonomous System

IEEE-CAA JOURNAL OF AUTOMATICA SINICA (2022)

Journal

IEEE-CAA JOURNAL OF AUTOMATICA SINICA

Volume 9, Issue 9, Pages 1561-1573

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/JAS.2022.105797

Keywords

Behavioral control; mission supervisor; nonlinear autonomous system; reinforcement learning

Funding

National Natural Science Foundation of China [61603094]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper proposes a novel two-layer reinforcement learning behavioral control method to reduce dependence on human intelligence by trial-and-error learning. The upper layer uses a reinforcement learning mission supervisor to learn the optimal mission priority and improves the dynamic performance of mission priority adjustment. The lower layer uses a reinforcement learning controller to learn the optimal control policy and reduces the control cost of mission priority adjustment.

Behavior-based autonomous systems rely on human intelligence to resolve multi-mission conflicts by designing mission priority rules and nonlinear controllers. In this work, a novel two-layer reinforcement learning behavioral control (RLBC) method is proposed to reduce such dependence by trial-and-error learning. Specifically, in the upper layer, a reinforcement learning mission supervisor (RLMS) is designed to learn the optimal mission priority. Compared with existing mission supervisors, the RLMS improves the dynamic performance of mission priority adjustment by maximizing cumulative rewards and reducing hardware storage demand when using neural networks. In the lower layer, a reinforcement learning controller (RLC) is designed to learn the optimal control policy. Compared with existing behavioral controllers, the RLC reduces the control cost of mission priority adjustment by balancing control performance and consumption. All error signals are proved to be semi-globally uniformly ultimately bounded (SGUUB). Simulation results show that the number of mission priority adjustment and the control cost are significantly reduced compared to some existing mission supervisors and behavioral controllers, respectively.

Reinforcement Learning Behavioral Control for Nonlinear Autonomous System

Journal

IEEE-CAA JOURNAL OF AUTOMATICA SINICA

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Reinforcement Learning Behavioral Control for Nonlinear Autonomous System

Journal

IEEE-CAA JOURNAL OF AUTOMATICA SINICA

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper