4.5 Article Proceedings Paper

Planning for cars that coordinate with people: leveraging effects on human actions for planning and active information gathering over human internal state

期刊

AUTONOMOUS ROBOTS
卷 42, 期 7, 页码 1405-1426

出版社

SPRINGER
DOI: 10.1007/s10514-018-9746-1

关键词

Planning for human-robot interaction; Mathematical models of human behavior; Autonomous driving

资金

  1. Berkeley DeepDrive
  2. NSF VeHICaL [1545126]
  3. NSF [CCF-1139138, CCF-1116993]
  4. ONR [N00014-09-1-0230]
  5. NSF CAREER [1652083]
  6. NDSEG Fellowship
  7. Direct For Computer & Info Scie & Enginr [1652083] Funding Source: National Science Foundation
  8. Div Of Information & Intelligent Systems [1652083] Funding Source: National Science Foundation

向作者/读者索取更多资源

Traditionally, autonomous cars treat human-driven vehicles like moving obstacles. They predict their future trajectories and plan to stay out of their way. While physically safe, this results in defensive and opaque behaviors. In reality, an autonomous car's actions will actually affect what other cars will do in response, creating an opportunity for coordination. Our thesis is that we can leverage these responses to plan more efficient and communicative behaviors. We introduce a formulation of interaction with human-driven vehicles as an underactuated dynamical system, in which the robot's actions have consequences on the state of the autonomous car, but also on the human actions and thus the state of the human-driven car. We model these consequences by approximating the human's actions as (noisily) optimal with respect to some utility function. The robot uses the human actions as observations of her underlying utility function parameters. We first explore learning these parameters offline, and show that a robot planning in the resulting underactuated system is more efficient than when treating the person as a moving obstacle. We also show that the robot can target specific desired effects, like getting the person to switch lanes or to proceed first through an intersection. We then explore estimating these parameters online, and enable the robot to perform active information gathering: generating actions that purposefully probe the human in order to clarify their underlying utility parameters, like driving style or attention level. We show that this significantly outperforms passive estimation and improves efficiency. Planning in our model results in coordination behaviors: the robot inches forward at an intersection to see if can go through, or it reverses to make the other car proceed first. These behaviors result from the optimization, without relying on hand-coded signaling strategies. Our user studies support the utility of our model when interacting with real users.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据