☆ 3.8 Proceedings Paper

Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning

2022 IEEE 15TH PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS 2022) (2022)

期刊

2022 IEEE 15TH PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS 2022)

卷 -, 期 -, 页码 111-120

出版社

IEEE COMPUTER SOC

DOI: 10.1109/PacificVis53943.2022.00020

关键词

Human-centered computing; Visualization; Visualization techniques; Treemaps; Human-centered computing; Visualization; Visualization design and evaluation methods

类别

Computer Science, Artificial Intelligence Computer Science, Software Engineering Imaging Science & Photographic Technology

资金

U.S. National Science Foundation [OAC-1934766]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper introduces a visual analytics interface called PolicyExplainer, which allows users to directly query the reasoning behind the actions of a reinforcement learning agent. By visualizing the agent's states, policy, and rewards, PolicyExplainer provides explanations for the agent's decisions, promoting trust and understanding.

Reinforcement learning (RL) is used in many domains, including autonomous driving, robotics, stock trading, and video games. Unfortunately, the black box nature of RL agents, combined with legal and ethical considerations, makes it increasingly important that humans (including those are who not experts in RL) understand the reasoning behind the actions taken by an RL agent, particularly in safety-critical domains. To help address this challenge, we introduce PolicyExplainer, a visual analytics interface which lets the user directly query an autonomous agent. PolicyExplainer visualizes the states, policy, and expected future rewards for an agent, and supports asking and answering questions such as: Why take this action? Why not take this other action? When is this action taken? PolicyExplainer is designed based upon a domain analysis with RL researchers, and is evaluated via qualitative and quantitative assessments on a trio of domains: taxi navigation, a stack bot domain, and drug recommendation for HIV patients.We find that PolicyExplainer's visual approach promotes trust and understanding of agent decisions better than a state-of-the-art text-based explanation approach. Interviews with domain practitioners provide further validation for PolicyExplainer as applied to safety-critical domains. Our results help demonstrate how visualization-based approaches can be leveraged to decode the behavior of autonomous RL agents, particularly for RL non-experts.

Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning

期刊

2022 IEEE 15TH PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS 2022)

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning

期刊

2022 IEEE 15TH PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS 2022)

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文