Reinforcement Learning Generalization with Surprise Minimization

doi:10.48550/arXiv.2004.12399

Reinforcement Learning Generalization with Surprise Minimization

Zikun Chen, Jerry

Generalization remains a challenging problem for deep reinforcement learning algorithms, which are often trained and tested on the same set of deterministic game environments. When test environments are unseen and perturbed but the nature of the task remains the same, generalization gaps can arise. In this work, we propose and evaluate a surprise minimizing agent on a generalization benchmark to show an additional reward learned from a simple density model can show robustness in procedurally generated game environments that provide constant source of entropy and stochasticity.

Publication:

arXiv e-prints

Pub Date:

April 2020

DOI:

10.48550/arXiv.2004.12399

arXiv:

arXiv:2004.12399

Bibcode:

2020arXiv200412399Z

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence

E-Print:

Inductive biases, invariances and generalization in RL Workshop, ICML 2020

NASA/ADS

Reinforcement Learning Generalization with Surprise Minimization

Abstract