☆ 4.7 Article

Dynamic Target Tracking of Autonomous Underwater Vehicle Based on Deep Reinforcement Learning

JOURNAL OF MARINE SCIENCE AND ENGINEERING (2022)

期刊

JOURNAL OF MARINE SCIENCE AND ENGINEERING

卷 10, 期 10, 页码 -

出版社

MDPI

DOI: 10.3390/jmse10101406

关键词

autonomous underwater vehicle; dynamic target tracking; deep reinforcement learning; experience replay technique

类别

Engineering, Marine Engineering, Ocean Oceanography

资金

Key Projects of Science and Technology Plan of Zhejiang Province [2019C04018]
Opening Research Fund of National Engineering Laboratory for Test and Experiment Technology of Marine Engineering Equipment [750NEL-2021-02]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This research proposes a model-free reinforcement learning algorithm for target tracking of autonomous underwater vehicles (AUVs). By utilizing the actor-critic framework and experience replay technique, the algorithm achieves quick and stable learning of target tracking tasks in various motion states, improving control performance.

Due to the unknown motion model and the complexity of the environment, the problem of target tracking for autonomous underwater vehicles (AUVs) became one of the major difficulties in model-based controllers. Therefore, the target tracking task of AUV is modeled as a Markov decision process (MDP) with unknown state transition probabilities. Based on actor-critic framework and experience replay technique, a model-free reinforcement learning algorithm is proposed to realize the dynamic target tracking of AUVs. In order to improve the performance of the algorithm, an adaptive experience replay scheme is further proposed. Specifically, the proposed algorithm utilizes the experience replay buffer to store and disrupt the samples, so that the time series samples can be used for training the neural network. Then, the sample priority is arranged according to the temporal difference error, while the adaptive parameters are introduced in the sample priority calculation, thus improving the experience replay rules. The results confirm the quick and stable learning of the proposed algorithm, when tracking the dynamic targets in various motion states. Additionally, the results also demonstrate good control performance regarding both stability and computational complexity, thus indicating the effectiveness of the proposed algorithm in target tracking tasks.

Dynamic Target Tracking of Autonomous Underwater Vehicle Based on Deep Reinforcement Learning

期刊

JOURNAL OF MARINE SCIENCE AND ENGINEERING

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Dynamic Target Tracking of Autonomous Underwater Vehicle Based on Deep Reinforcement Learning

期刊

JOURNAL OF MARINE SCIENCE AND ENGINEERING

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文