4.6 Article

Comparison of model-free and model-based methods for time optimal hit control of a badminton robot

期刊

MECHATRONICS
卷 24, 期 8, 页码 1021-1030

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.mechatronics.2014.08.001

关键词

Robot control; Time optimal motion; Optimization; Reinforcement learning; Natural actor-critic

资金

  1. Institute for the Promotion of Innovation through Science and Technology in Flanders (IWT-Vlaanderen) [IWT-SBO 80032]

向作者/读者索取更多资源

In this research, time optimal control is considered for the hit motion of a badminton robot during a serve operation. Even though the robot always starts at rest in a given position, it has to move to a target position where the target velocity is not zero, as the robot has to hit the shuttle at that point. The goal is to reach this target state as quickly as possible, yet without violating the limitations of the actuator. To find controllers satisfying these requirements, both model-based and model-free controllers have been developed, with the model-free controllers employing a Natural Actor-Critic (NAC) reinforcement learning algorithm. The model-based controllers can immediately achieve the desired motions relying on prior model information, while the model-free methods are shown to yield the desired robot motions after about 200 trials. However, in order to achieve this result, a good choice of the reward function is essential. To illustrate this choice and validate the resulting controller, a simulation study is presented in which the model-based results are compared to those obtained with two different reward functions. (C) 2014 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据