4.7 Article

Reinforcement learning based CPG-controlled method with high adaptability and robustness: An experimental study on a robotic fishtail

期刊

OCEAN ENGINEERING
卷 289, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.oceaneng.2023.116259

关键词

Control strategy; Reinforcement learning; Robotic fish; Experimental investigation; Optimal control

向作者/读者索取更多资源

This paper proposes a reinforcement learning based control strategy for autonomous decision-making of robotic fishtails in complex flows. The experimental results confirm the superiority of the control strategy in terms of accuracy, stability, and response speed.
An adaptive and robust control is imperative for robots operating in complex environments. The artificial intelligence approach presents a promising solution, while training and practical application in an actual robot remain a significant challenge. This paper proposes a reinforcement learning based control strategy with lightweight computation, aiming to optimize thrust in complex flows and varying self-structural characteristics of robotic fishtails. Improved Q-learning algorithm and CPG control are utilized to enable the robotic fishtail to make autonomous and accurate decisions in response to unknown changes. The control strategy is trained in actual physical flow fields, with an action selection strategy and a reward system proposed to accelerate convergence and enhance training stability. An integrated system of online learning, response measuring, and real-time monitoring is developed for the training and testing processes. Variable environmental tests are performed under different turbulent flows. Distinct caudal fins are utilized to conduct tests on self-structural variation. The experimental results confirm the robustness and adaptability of the control strategy, as well as its superiority over the PID approach in terms of accuracy, stability, and response speed. The control strategy architecture and physical experimental method could offer experience and application reference for the intelligent control of actual robots.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据