Policy Optimization with Sparse Global Contrastive Explanations

doi:10.48550/arXiv.2207.06269

Policy Optimization with Sparse Global Contrastive Explanations

We develop a Reinforcement Learning (RL) framework for improving an existing behavior policy via sparse, user-interpretable changes. Our goal is to make minimal changes while gaining as much benefit as possible. We define a minimal change as having a sparse, global contrastive explanation between the original and proposed policy. We improve the current policy with the constraint of keeping that global contrastive explanation short. We demonstrate our framework with a discrete MDP and a continuous 2D navigation domain.

Publication:

arXiv e-prints

Pub Date:

July 2022

DOI:

10.48550/arXiv.2207.06269

arXiv:

arXiv:2207.06269

Bibcode:

2022arXiv220706269Y

Keywords:

Computer Science - Machine Learning

E-Print:

Accepted at IMLH Workshop, ICML 2022

NASA/ADS

Policy Optimization with Sparse Global Contrastive Explanations

Abstract