4.6 Article

A neural integrator model for planning and value-based decision making of a robotics assistant

期刊

NEURAL COMPUTING & APPLICATIONS
卷 33, 期 8, 页码 3737-3756

出版社

SPRINGER LONDON LTD
DOI: 10.1007/s00521-020-05224-8

关键词

Dynamic Neural Field; Neural integrator; Assembly robot; Value-based decision making; Sequence learning

资金

  1. FCT [PD/BD/128183/2016, SFRH/BD/124912/2016, PTDC/MAT-APL/31393/2017]
  2. research centre CMAT [UID/MAT/00013/2020]
  3. Fundação para a Ciência e a Tecnologia [SFRH/BD/124912/2016, PTDC/MAT-APL/31393/2017, PD/BD/128183/2016] Funding Source: FCT

向作者/读者索取更多资源

Modern manufacturing and assembly environments are characterized by high variability in the built process, challenging human-robot cooperation. Research shows that robots can learn, plan, and make decisions through brain-like computations based on Dynamic Neural Fields, reducing the cognitive workload of operators and improving efficiency.
Modern manufacturing and assembly environments are characterized by a high variability in the built process which challenges human-robot cooperation. To reduce the cognitive workload of the operator, the robot should not only be able to learn from experience but also to plan and decide autonomously. Here, we present an approach based on Dynamic Neural Fields that apply brain-like computations to endow a robot with these cognitive functions. A neural integrator is used to model the gradual accumulation of sensory and other evidence as time-varying persistent activity of neural populations. The decision to act is modeled by a competitive dynamics between neural populations linked to different motor behaviors. They receive the persistent activation pattern of the integrators as input. In the first experiment, a robot learns rapidly by observation the sequential order of object transfers between an assistant and an operator to subsequently substitute the assistant in the joint task. The results show that the robot is able to proactively plan the series of handovers in the correct order. In the second experiment, a mobile robot searches at two different workbenches for a specific object to deliver it to an operator. The object may appear at the two locations in a certain time period with independent probabilities unknown to the robot. The trial-by-trial decision under uncertainty is biased by the accumulated evidence of past successes and choices. The choice behavior over a longer period reveals that the robot achieves a high search efficiency in stationary as well as dynamic environments.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据