☆ 3.8 Proceedings Paper

VP-GO: A 'Light' Action-Conditioned Visual Prediction Model for Grasping Objects

2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022) (2022)

期刊

2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022)

卷 -, 期 -, 页码 37-44

出版社

IEEE

DOI: 10.1109/ICARM54641.2022.9959321

关键词

类别

Automation & Control Systems Engineering, Electrical & Electronic Robotics

资金

CHIST-ERA project HEAP
Creativ'Lab in Loria

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this paper, VP-GO, a light stochastic action-conditioned visual prediction model, is proposed for robotic grasping of unknown soft objects. By decomposing semantic actions into elementary movements, compatibility with existing models and datasets is ensured. A new open dataset called PandaGrasp is also provided for visual prediction of object grasping.

Visual prediction models are promising solutions for visual-based robotic grasping of cluttered, unknown soft objects. Previous models from the literature are computationally greedy, which limits reproducibility; although some consider stochasticity in the prediction model, it is often too weak to catch the reality of robotics experiments involving grasping such objects. Furthermore, previous work focused on elementary movements that are not efficient to reason in terms of more complex semantic actions. To address these limitations, we propose VP-GO, a light stochastic action-conditioned visual prediction model. We propose a hierarchical decomposition of semantic grasping and manipulation actions into elementary end-effector movements, to ensure compatibility with existing models and datasets for visual prediction of robotic actions such as RoboNet. We also record and release a new open dataset for visual prediction of object grasping, called PandaGrasp. Our model can be pre-trained on RoboNet and fine-tuned on PandaGrasp, and performs similarly to more complex models in terms of signal prediction metrics. Qualitatively, it outperforms when predicting the outcome of complex grasps performed by our robot.

VP-GO: A 'Light' Action-Conditioned Visual Prediction Model for Grasping Objects

期刊

2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022)

出版社

IEEE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

VP-GO: A 'Light' Action-Conditioned Visual Prediction Model for Grasping Objects

期刊

2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022)

出版社

IEEE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文