Self-Organizing Maps as a Storage and Transfer Mechanism in Reinforcement Learning

doi:10.48550/arXiv.1807.07530

Self-Organizing Maps as a Storage and Transfer Mechanism in Reinforcement Learning

The idea of reusing information from previously learned tasks (source tasks) for the learning of new tasks (target tasks) has the potential to significantly improve the sample efficiency reinforcement learning agents. In this work, we describe an approach to concisely store and represent learned task knowledge, and reuse it by allowing it to guide the exploration of an agent while it learns new tasks. In order to do so, we use a measure of similarity that is defined directly in the space of parameterized representations of the value functions. This similarity measure is also used as a basis for a variant of the growing self-organizing map algorithm, which is simultaneously used to enable the storage of previously acquired task knowledge in an adaptive and scalable manner.We empirically validate our approach in a simulated navigation environment and discuss possible extensions to this approach along with potential applications where it could be particularly useful.

Publication:

arXiv e-prints

Pub Date:

July 2018

DOI:

10.48550/arXiv.1807.07530

arXiv:

arXiv:1807.07530

Bibcode:

2018arXiv180707530K

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence;
Statistics - Machine Learning

E-Print:

7 pages, 7 figures, presented at ALA Workshop, FAIM, Stockholm, 2018

NASA/ADS

Self-Organizing Maps as a Storage and Transfer Mechanism in Reinforcement Learning

Abstract