4.6 Article

Coactive design of explainable agent-based task planning and deep reinforcement learning for human-UAVs teamwork

期刊

CHINESE JOURNAL OF AERONAUTICS
卷 33, 期 11, 页码 2930-2945

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.cja.2020.05.001

关键词

Coactive design; Deep reinforcement learning; Human-robot teamwork; Mixed-initiative; Multi-agent system; Task planning; UAV

资金

  1. National Natural Science Foundation of China [61906203, 61876187]
  2. National Key Laboratory of Science and Technology on UAV, Northwestern Polytechnical University, China [614230110080817]

向作者/读者索取更多资源

Unmanned Aerial Vehicles (UAVs) are useful in dangerous and dynamic tasks such as search-and-rescue, forest surveillance, and anti-terrorist operations. These tasks can be solved better through the collaboration of multiple UAVs under human supervision. However, it is still difficult for human to monitor, understand, predict and control the behaviors of the UAVs due to the task complexity as well as the black-box machine learning and planning algorithms being used. In this paper, the coactive design method is adopted to analyze the cognitive capabilities required for the tasks and design the interdependencies among the heterogeneous teammates of UAVs or human for coherent collaboration. Then, an agent-based task planner is proposed to automatically decompose a complex task into a sequence of explainable subtasks under constrains of resources, execution time, social rules and costs. Besides, a deep reinforcement learning approach is designed for the UAVs to learn optimal policies of a flocking behavior and a path planner that are easy for the human operator to understand and control. Finally, a mixed-initiative action selection mechanism is used to evaluate the learned policies as well as the human's decisions. Experimental results demonstrate the effectiveness of the proposed methods. (c) 2020 Chinese Society of Aeronautics and Astronautics. Production and hosting by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据