Learning to Play Air Hockey with Model-Based Deep Reinforcement Learning

doi:10.48550/arXiv.2406.00518

Learning to Play Air Hockey with Model-Based Deep Reinforcement Learning

Orsula, Andrej

In the context of addressing the Robot Air Hockey Challenge 2023, we investigate the applicability of model-based deep reinforcement learning to acquire a policy capable of autonomously playing air hockey. Our agents learn solely from sparse rewards while incorporating self-play to iteratively refine their behaviour over time. The robotic manipulator is interfaced using continuous high-level actions for position-based control in the Cartesian plane while having partial observability of the environment with stochastic transitions. We demonstrate that agents are prone to overfitting when trained solely against a single playstyle, highlighting the importance of self-play for generalization to novel strategies of unseen opponents. Furthermore, the impact of the imagination horizon is explored in the competitive setting of the highly dynamic game of air hockey, with longer horizons resulting in more stable learning and better overall performance.

Publication:

arXiv e-prints

Pub Date:

June 2024

DOI:

10.48550/arXiv.2406.00518

arXiv:

arXiv:2406.00518

Bibcode:

2024arXiv240600518O

Keywords:

Computer Science - Robotics;
Computer Science - Artificial Intelligence;
Computer Science - Machine Learning

E-Print:

Robot Air Hockey Challenge 2023 | The source code is available at https://github.com/AndrejOrsula/drl_air_hockey

NASA/ADS

Learning to Play Air Hockey with Model-Based Deep Reinforcement Learning

Abstract