☆ 4.7 Article

Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning

JOURNAL OF MACHINE LEARNING RESEARCH (2022)

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

卷 23, 期 -, 页码 -

出版社

MICROTOME PUBL

关键词

Developmental learning; developmental AI; open-ended learning; intrinsic motivations; autotelic agents; population-based IMGEP; goal exploration; curiosity-driven learning; modularity; robotics; automatic curriculum learning

类别

Automation & Control Systems Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Intrinsically motivated spontaneous exploration is crucial for autonomous developmental learning in human children, and the Intrinsically Motivated Goal Exploration Processes (IMGEP) algorithmic approach enables similar autonomous learning properties in machines. The IMGEP architecture relies on principles such as self-generation of goals, goal selection based on intrinsic rewards, and systematic reuse of acquired information. The AMB, a highly efficient form of IMGEP, can automatically generate a learning curriculum and has been demonstrated in various experimental setups.

Intrinsically motivated spontaneous exploration is a key enabler of autonomous developmental learning in human children. It enables the discovery of skill repertoires through autotelic learning, i.e. the self-generation, self-selection, self-ordering and self-experimentation of learning goals. We present an algorithmic approach called Intrinsically Motivated Goal Exploration Processes (IMGEP) to enable similar properties of autonomous learning in machines. The IMGEP architecture relies on several principles: 1) self-generation of goals, generalized as parameterized fitness functions; 2) selection of goals based on intrinsic rewards; 3) exploration with incremental goal-parameterized policy search and exploitation with a batch learning algorithm; 4) systematic reuse of information acquired when targeting a goal for improving towards other goals. We present a particularly efficient form of IMGEP, called AMB, that uses a population-based policy and an object-centered spatio-temp oral modularity. We provide several implementations of this architecture and demonstrate their ability to automatically generate a learning curriculum within several experimental setups. One of these experiments includes a real humanoid robot exploring multiple spaces of goals with several hundred continuous dimensions and with distractors. While no particular target goal is provided to these autotelic agents, this curriculum allows the discovery of diverse skills that act as stepping stones for learning more complex skills, e.g. nested tool use.

Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

出版社

MICROTOME PUBL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

出版社

MICROTOME PUBL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文