Policy Optimization with Sparse Global Contrastive Explanations
Abstract
We develop a Reinforcement Learning (RL) framework for improving an existing behavior policy via sparse, user-interpretable changes. Our goal is to make minimal changes while gaining as much benefit as possible. We define a minimal change as having a sparse, global contrastive explanation between the original and proposed policy. We improve the current policy with the constraint of keeping that global contrastive explanation short. We demonstrate our framework with a discrete MDP and a continuous 2D navigation domain.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2022
- DOI:
- 10.48550/arXiv.2207.06269
- arXiv:
- arXiv:2207.06269
- Bibcode:
- 2022arXiv220706269Y
- Keywords:
-
- Computer Science - Machine Learning
- E-Print:
- Accepted at IMLH Workshop, ICML 2022