Exploration for Multi-task Reinforcement Learning with Deep Generative Models

doi:10.48550/arXiv.1611.09894

Exploration for Multi-task Reinforcement Learning with Deep Generative Models

Exploration in multi-task reinforcement learning is critical in training agents to deduce the underlying MDP. Many of the existing exploration frameworks such as $E^3$, $R_{max}$, Thompson sampling assume a single stationary MDP and are not suitable for system identification in the multi-task setting. We present a novel method to facilitate exploration in multi-task reinforcement learning using deep generative models. We supplement our method with a low dimensional energy model to learn the underlying MDP distribution and provide a resilient and adaptive exploration signal to the agent. We evaluate our method on a new set of environments and provide intuitive interpretation of our results.

Publication:

arXiv e-prints

Pub Date:

November 2016

DOI:

10.48550/arXiv.1611.09894

arXiv:

arXiv:1611.09894

Bibcode:

2016arXiv161109894P

Keywords:

Computer Science - Artificial Intelligence;
Computer Science - Machine Learning;
Statistics - Machine Learning;
I.2;
I.5

E-Print:

9 pages, 5 figures

NASA/ADS

Exploration for Multi-task Reinforcement Learning with Deep Generative Models

Abstract