Complementary reinforcement learning towards explainable agents

Lee, Jung Hoon

Computer Science > Machine Learning

arXiv:1901.00188 (cs)

[Submitted on 1 Jan 2019 (v1), last revised 24 Jan 2019 (this version, v2)]

Title:Complementary reinforcement learning towards explainable agents

Authors:Jung Hoon Lee

View PDF

Abstract:Reinforcement learning (RL) algorithms allow agents to learn skills and strategies to perform complex tasks without detailed instructions or expensive labelled training examples. That is, RL agents can learn, as we learn. Given the importance of learning in our intelligence, RL has been thought to be one of key components to general artificial intelligence, and recent breakthroughs in deep reinforcement learning suggest that neural networks (NN) are natural platforms for RL agents. However, despite the efficiency and versatility of NN-based RL agents, their decision-making remains incomprehensible, reducing their utilities. To deploy RL into a wider range of applications, it is imperative to develop explainable NN-based RL agents. Here, we propose a method to derive a secondary comprehensible agent from a NN-based RL agent, whose decision-makings are based on simple rules. Our empirical evaluation of this secondary agent's performance supports the possibility of building a comprehensible and transparent agent using a NN-based RL agent.

Comments:	14 pages, 5 figures
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1901.00188 [cs.LG]
	(or arXiv:1901.00188v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1901.00188

Submission history

From: Jung Lee [view email]
[v1] Tue, 1 Jan 2019 18:14:35 UTC (578 KB)
[v2] Thu, 24 Jan 2019 02:29:24 UTC (1,269 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-01

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jung Hoon Lee

export BibTeX citation

Computer Science > Machine Learning

Title:Complementary reinforcement learning towards explainable agents

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Complementary reinforcement learning towards explainable agents

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators