☆ 4.7 Article

Interestingness elements for explainable reinforcement learning: Understanding agents' capabilities and limitations

ARTIFICIAL INTELLIGENCE (2020)

期刊

ARTIFICIAL INTELLIGENCE

卷 288, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.artint.2020.103367

关键词

Explainable AI; Reinforcement learning; Interestingness elements; Autonomy; Video highlights; Visual explanations

类别

Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

We propose an explainable reinforcement learning (XRL) framework that analyzes an agent's history of interaction with the environment to extract interestingness elements that help explain its behavior. The framework relies on data readily available from standard RL algorithms, augmented with data that can easily be collected by the agent while learning. We describe how to create visual summaries of an agent's behavior in the form of short video-clips highlighting key interaction moments, based on the proposed elements. We also report on a user study where we evaluated the ability of humans to correctly perceive the aptitude of agents with different characteristics, including their capabilities and limitations, given visual summaries automatically generated by our framework. The results show that the diversity of aspects captured by the different interestingness elements is crucial to help humans correctly understand an agent's strengths and limitations in performing a task, and determine when it might need adjustments to improve its performance. (C) 2020 Elsevier B.V. All rights reserved.

Interestingness elements for explainable reinforcement learning: Understanding agents' capabilities and limitations

期刊

ARTIFICIAL INTELLIGENCE

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Interestingness elements for explainable reinforcement learning: Understanding agents' capabilities and limitations

期刊

ARTIFICIAL INTELLIGENCE

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文