Massively Parallel Methods for Deep Reinforcement Learning

doi:10.48550/arXiv.1507.04296

Massively Parallel Methods for Deep Reinforcement Learning

We present the first massively distributed architecture for deep reinforcement learning. This architecture uses four main components: parallel actors that generate new behaviour; parallel learners that are trained from stored experience; a distributed neural network to represent the value function or behaviour policy; and a distributed store of experience. We used our architecture to implement the Deep Q-Network algorithm (DQN). Our distributed algorithm was applied to 49 games from Atari 2600 games from the Arcade Learning Environment, using identical hyperparameters. Our performance surpassed non-distributed DQN in 41 of the 49 games and also reduced the wall-time required to achieve these results by an order of magnitude on most games.

Publication:

arXiv e-prints

Pub Date:

July 2015

DOI:

10.48550/arXiv.1507.04296

arXiv:

arXiv:1507.04296

Bibcode:

2015arXiv150704296N

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence;
Computer Science - Distributed;
Parallel;
and Cluster Computing;
Computer Science - Neural and Evolutionary Computing

E-Print:

Presented at the Deep Learning Workshop, International Conference on Machine Learning, Lille, France, 2015

NASA/ADS

Massively Parallel Methods for Deep Reinforcement Learning

Abstract