4.7 Article

A path planning strategy unified with a COLREGS collision avoidance function based on deep reinforcement learning and artificial potential field

期刊

APPLIED OCEAN RESEARCH
卷 113, 期 -, 页码 -

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.apor.2021.102759

关键词

Deep reinforcement learning; Path planning; Artificial potential field; COLREGS collision avoidance

资金

  1. National Natural Science Foundation of China [51809113, 51249006]
  2. Fujian Province Science and Technology Department [2019H0019, 2019H6017]
  3. Fujian Provincial Young TopNotch Talent Plan [Z02101]
  4. Project of Intelligent Situation Awareness System for Smart Ship [MC-201920-X01]
  5. Fujian Provincial Department of Ocean and Fisheries [FJHJF-L-2020-6]

向作者/读者索取更多资源

This study proposes a path planning strategy for collision avoidance of unmanned surface vessels in uncertain environments using deep reinforcement learning, with the artificial potential field algorithm improving the action space and reward function of the algorithm to effectively address collision avoidance issues. Simulation experiments demonstrate the effectiveness of the enhanced deep reinforcement learning in autonomous collision avoidance path planning.
Improving the autopilot capability of ships is particularly important to ensure the safety of maritime navigation. The unmanned surface vessel (USV) with autopilot capability is a development trend of the ship of the future. The objective of this paper is to investigate the path planning problem of USVs in uncertain environments, and a path planning strategy unified with a collision avoidance function based on deep reinforcement learning (DRL) is proposed. A Deep Q-learning network (DQN) is used to continuously interact with the visually simulated environment to obtain experience data, so that the agent learns the best action strategies in the visual simulated environment. To solve the collision avoidance problems that may occur during USV navigation, the location of the obstacle ship is divided into four collision avoidance zones according to the International Regulations for Preventing Collisions at Sea (COLREGS). To obtain an improved DRL algorithm, the artificial potential field (APF) algorithm is utilized to improve the action space and reward function of the DQN algorithm. A simulation experiments is utilized to test the effects of our method in various situations. It is also shown that the enhanced DRL can effectively realize autonomous collision avoidance path planning.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据