4.4 Article

Functional Contour-following via Haptic Perception and Reinforcement Learning

期刊

IEEE TRANSACTIONS ON HAPTICS
卷 11, 期 1, 页码 61-72

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TOH.2017.2753233

关键词

Active touch; contour-following; decision making; haptic perception; manipulation; reinforcement learning

资金

  1. National Science Foundation [1461547, 1463960, 1533983]
  2. Office of Naval Research [N00014-16-1-2468]
  3. Directorate For Engineering [1463960, 1533983] Funding Source: National Science Foundation
  4. Div Of Chem, Bioeng, Env, & Transp Sys [1463960] Funding Source: National Science Foundation
  5. Div Of Industrial Innovation & Partnersh [1533983] Funding Source: National Science Foundation
  6. Div Of Information & Intelligent Systems
  7. Direct For Computer & Info Scie & Enginr [1461547] Funding Source: National Science Foundation

向作者/读者索取更多资源

Many tasks involve the fine manipulation of objects despite limited visual feedback. In such scenarios, tactile and proprioceptive feedback can be leveraged for task completion. We present an approach for real-time haptic perception and decision-making for a haptics-driven, functional contour-following task: the closure of a ziplock bag. This task is challenging for robots because the bag is deformable, transparent, and visually occluded by artificial fingertip sensors that are also compliant. A deep neural net classifier was trained to estimate the state of a zipper within a robot's pinch grasp. A Contextual Multi-Armed Bandit (C-MAB) reinforcement learning algorithm was implemented to maximize cumulative rewards by balancing exploration versus exploitation of the state-action space. The C-MAB learner outperformed a benchmark Q-learner by more efficiently exploring the state-action space while learning a hard-to-code task. The learned C-MAB policy was tested with novel ziplock bag scenarios and contours (wire, rope). Importantly, this work contributes to the development of reinforcement learning approaches that account for limited resources such as hardware life and researcher time. As robots are used to perform complex, physically interactive tasks in unstructured or unmodeled environments, it becomes important to develop methods that enable efficient and effective learning with physical testbeds.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据