Reinforcement Learning with Time-dependent Goals for Robotic Musicians

doi:10.48550/arXiv.2011.05715

Reinforcement Learning with Time-dependent Goals for Robotic Musicians

Reinforcement learning is a promising method to accomplish robotic control tasks. The task of playing musical instruments is, however, largely unexplored because it involves the challenge of achieving sequential goals - melodies - that have a temporal dimension. In this paper, we address robotic musicianship by introducing a temporal extension to goal-conditioned reinforcement learning: Time-dependent goals. We demonstrate that these can be used to train a robotic musician to play the theremin instrument. We train the robotic agent in simulation and transfer the acquired policy to a real-world robotic thereminist. Supplemental video: https://youtu.be/jvC9mPzdQN4

Publication:

arXiv e-prints

Pub Date:

November 2020

DOI:

10.48550/arXiv.2011.05715

arXiv:

arXiv:2011.05715

Bibcode:

2020arXiv201105715F

Keywords:

Computer Science - Robotics;
Computer Science - Artificial Intelligence

E-Print:

Preprint, submitted to IEEE Robotics and Automation Letters (RA-L) 2021 with International Conference on Robotics and Automation Conference Option (ICRA) 2021

NASA/ADS

Reinforcement Learning with Time-dependent Goals for Robotic Musicians

Abstract