4.7 Article

A versatile interaction framework for robot programming based on hand gestures and poses

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.rcim.2023.102596

关键词

Robot programming; Human-robot interaction; Hand gesture dataset; Hand gesture recognition; Hand pose estimation

向作者/读者索取更多资源

This paper proposes a framework for programming industrial and collaborative robots using hand gestures and poses. The framework allows operators to control the robot using both End-Effector (EE) and joint movements, and accurately transfer compound shapes to the robot. The hand gestures are classified using a deep neural network, achieving 99% test accuracy. The proposed method outperforms other gesture sets in terms of reaction time and accuracy in indicating instructions and reconstructing compound shapes.
This paper proposes a framework for industrial and collaborative robot programming based on the integration of hand gestures and poses. The framework allows operators to control the robot via both End-Effector (EE) and joint movements and to transfer compound shapes accurately to the robot. Seventeen hand gestures, which cover the position and orientation controls of the robotic EE and other auxiliary operations, are designed according to cognitive psychology. Gestures are classified by a deep neural network, which is pre-trained for two-hand pose estimation and fine-tuned on a custom dataset, achieving a test accuracy of 99%. The index finger's pointing direction and the hand's orientation are extracted via 3D hand pose estimation to indicate the robotic EE's moving direction and orientation, respectively. The number of stretched fingers is detected via two-hand pose estimation to represent decimal digits for selecting robot joints and inputting numbers. Finally, we integrate these three manners seamlessly to form a programming framework. We conducted two interaction experiments. The reaction time of the proposed hand gestures in indicating randomly given instructions is significantly less than that of other gesture sets, such as American Sign Language (ASL). The accuracy of our method in compound shape reconstruction is much better than that of hand movement trajectory-based methods, and the operating time is comparable with that of teach pendants.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据