4.7 Article

Learning to Play Table Tennis From Scratch Using Muscular Robots

期刊

IEEE TRANSACTIONS ON ROBOTICS
卷 38, 期 6, 页码 3850-3860

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TRO.2022.3176207

关键词

Dynamic task; pneumatic muscles; real world robotics; reinforcement learning (RL); robot table tennis; sim-to-real

类别

资金

  1. Max Planck Institute for Intelligent Systems, Tubingen (Germany)

向作者/读者索取更多资源

This article demonstrates the safe end-to-end learning of table tennis using robot arms driven by pneumatic artificial muscles (PAMs) through model-free reinforcement learning (RL). It also introduces a practical procedure called hybrid sim and real training (HYSR) to train the robot without using real balls.
Dynamic tasks such as table tennis are relatively easy to learn for humans, but pose significant challenges to robots. Such tasks require accurate control of fast movements and precise timing in the presence of imprecise state estimation of the flying ball and the robot. Reinforcement learning (RL) has shown promise in learning complex control tasks from data. However, applying step-based RL to dynamic tasks on real systems is safety-critical as RL requires exploring and failing safely for millions of time steps in high-speed and high-acceleration regimes. This article demonstrates that using robot arms driven by pneumatic artificial muscles (PAMs) enables safe end-to-end learning of table tennis using model-free RL. In particular, we learn from scratch for thousands of trials while a stochastic policy acts on the low-level controls of the real system. The robot returns and smashes real balls with 5 ms(-1) and 12 ms(-1) on average, respectively, to a desired landing point. Additionally, we present hybrid sim and real training (HYSR), a practical procedure that avoids training with real balls by virtually replaying recorded ball trajectories and applying actions to the real robot. To the best of authors' knowledge, this work pioneers (i) failsafe learning of a safety-critical dynamic task using anthropomorphic robot arms, (ii) learning a precision-demanding problem with a PAM-driven system that is inherently hard to control as well as (iii) train a robot to play table tennis without real balls.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据