Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms

doi:10.48550/arXiv.2404.10645

Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms

Zhou, Zehao

Distributed Distributional DrQ is a model-free and off-policy RL algorithm for continuous control tasks based on the state and observation of the agent, which is an actor-critic method with the data-augmentation and the distributional perspective of critic value function. Aim to learn to control the agent and master some tasks in a high-dimensional continuous space. DrQ-v2 uses DDPG as the backbone and achieves out-performance in various continuous control tasks. Here Distributed Distributional DrQ uses Distributed Distributional DDPG as the backbone, and this modification aims to achieve better performance in some hard continuous control tasks through the better expression ability of distributional value function and distributed actor policies.

Publication:

arXiv e-prints

Pub Date:

April 2024

DOI:

10.48550/arXiv.2404.10645

arXiv:

arXiv:2404.10645

Bibcode:

2024arXiv240410645Z

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence;
Computer Science - Robotics

E-Print:

11 pages, 12 figures

NASA/ADS

Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms

Abstract