4.7 Article

Safe deep reinforcement learning-based adaptive control for USV interception mission

期刊

OCEAN ENGINEERING
卷 246, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.oceaneng.2021.110477

关键词

Unmanned surface vessels; Safe reinforcement learning; Data-based learning control; Uniformly ultimate bounded stability; Interception mission

资金

  1. Key R&D Program of Guangdong [B1111010002]
  2. National Key R&D Program of China [2017YFE0128500]

向作者/读者索取更多资源

This paper presents a safe learning scheme for the USV interception mission using a safe Lyapunov boundary deep deterministic policy gradient algorithm. By applying a single neuron proportional adaptive control for pre-training, the proposed method quickly converges to feasible solutions subject to safety constraints and demonstrates high performance in stability and safety.
This paper aims to develop a safe learning scheme of the USV interception mission. A safe Lyapunov boundary deep deterministic policy gradient (SLDDPG) algorithm is presented for the USV interception mission. The uniformly ultimate bounded (UUB) stability of control systems is analyzed under finite safety constraints. A single neuron proportional adaptive control (SNPAC) is applied to pre-train the deep policy network for speeding up the training process. The proposed method is evaluated by a series of simulations of the USVs interception and tracking mission. Compared with the existing results, our method can fast converge to the feasible solution subject to safety constraints and demonstrate a high performance in stability and safety by virtual-reality experiments.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据