Meta-Reinforcement Learning for Heuristic Planning

doi:10.48550/arXiv.2107.02603

Meta-Reinforcement Learning for Heuristic Planning

In Meta-Reinforcement Learning (meta-RL) an agent is trained on a set of tasks to prepare for and learn faster in new, unseen, but related tasks. The training tasks are usually hand-crafted to be representative of the expected distribution of test tasks and hence all used in training. We show that given a set of training tasks, learning can be both faster and more effective (leading to better performance in the test tasks), if the training tasks are appropriately selected. We propose a task selection algorithm, Information-Theoretic Task Selection (ITTS), based on information theory, which optimizes the set of tasks used for training in meta-RL, irrespectively of how they are generated. The algorithm establishes which training tasks are both sufficiently relevant for the test tasks, and different enough from one another. We reproduce different meta-RL experiments from the literature and show that ITTS improves the final performance in all of them.

Publication:

arXiv e-prints

Pub Date:

July 2021

DOI:

10.48550/arXiv.2107.02603

arXiv:

arXiv:2107.02603

Bibcode:

2021arXiv210702603L

Keywords:

Computer Science - Artificial Intelligence;
Computer Science - Machine Learning

E-Print:

ICAPS 2021

NASA/ADS

Meta-Reinforcement Learning for Heuristic Planning

Abstract