☆ 4.6 Article

What Matters in Language Conditioned Robotic Imitation Learning Over Unstructured Data

IEEE ROBOTICS AND AUTOMATION LETTERS (2022)

Journal

IEEE ROBOTICS AND AUTOMATION LETTERS

Volume 7, Issue 4, Pages 11205-11212

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/LRA.2022.3196123

Keywords

Imitation learning; learning categories and concepts; machine learning for robot control

Funding

German Federal Ministry of Education and Research [01IS18040B-OML]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper conducts an extensive study of the critical challenges in learning language conditioned policies from offline free-form imitation datasets. It identifies architectural and algorithmic techniques that improve performance and presents a novel approach that significantly outperforms the state of the art on challenging language conditioned long-horizon robot manipulation.

A long-standing goal in robotics is to build robots that can perform a wide range of daily tasks from perceptions obtained with their onboard sensors and specified only via natural language. While recently substantial advances have been achieved in language-driven robotics by leveraging end-to-end learning from pixels, there is no clear and well-understood process for making various design choices due to the underlying variation in setups. In this letter, we conduct an extensive study of the most critical challenges in learning language conditioned policies from offline free-form imitation datasets. We further identify architectural and algorithmic techniques that improve performance, such as a hierarchical decomposition of the robot control learning, a multimodal transformer encoder, discrete latent plans and a self-supervised contrastive loss that aligns video and language representations. By combining the results of our investigation with our improved model components, we are able to present a novel approach that significantly outperforms the state of the art on the challenging language conditioned long-horizon robot manipulation CALVIN benchmark. We have open-sourced our implementation to facilitate future research in learning to perform many complex manipulation skills in a row specified with natural language.

What Matters in Language Conditioned Robotic Imitation Learning Over Unstructured Data

Journal

IEEE ROBOTICS AND AUTOMATION LETTERS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

What Matters in Language Conditioned Robotic Imitation Learning Over Unstructured Data

Journal

IEEE ROBOTICS AND AUTOMATION LETTERS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper