Planning in Dynamic Environments with Conditional Autoregressive Models

doi:10.48550/arXiv.1811.10097

Planning in Dynamic Environments with Conditional Autoregressive Models

We demonstrate the use of conditional autoregressive generative models (van den Oord et al., 2016a) over a discrete latent space (van den Oord et al., 2017b) for forward planning with MCTS. In order to test this method, we introduce a new environment featuring varying difficulty levels, along with moving goals and obstacles. The combination of high-quality frame generation and classical planning approaches nearly matches true environment performance for our task, demonstrating the usefulness of this method for model-based planning in dynamic environments.

Publication:

arXiv e-prints

Pub Date:

November 2018

DOI:

10.48550/arXiv.1811.10097

arXiv:

arXiv:1811.10097

Bibcode:

2018arXiv181110097H

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence;
Computer Science - Robotics;
Statistics - Machine Learning

E-Print:

6 pages, 1 figure, in Proceedings of the Prediction and Generative Modeling in Reinforcement Learning Workshop at the International Conference on Machine Learning (ICML) in 2018

NASA/ADS

Planning in Dynamic Environments with Conditional Autoregressive Models

Abstract