Indoor Path Planning for Multiple Unmanned Aerial Vehicles via Curriculum Learning

doi:10.48550/arXiv.2209.01776

Indoor Path Planning for Multiple Unmanned Aerial Vehicles via Curriculum Learning

Multi-agent reinforcement learning was performed in this study for indoor path planning of two unmanned aerial vehicles (UAVs). Each UAV performed the task of moving as fast as possible from a randomly paired initial position to a goal position in an environment with obstacles. To minimize training time and prevent the damage of UAVs, learning was performed by simulation. Considering the non-stationary characteristics of the multi-agent environment wherein the optimal behavior varies based on the actions of other agents, the action of the other UAV was also included in the state space of each UAV. Curriculum learning was performed in two stages to increase learning efficiency. A goal rate of 89.0% was obtained compared with other learning strategies that obtained goal rates of 73.6% and 79.9%.

Publication:

arXiv e-prints

Pub Date:

September 2022

DOI:

10.48550/arXiv.2209.01776

arXiv:

arXiv:2209.01776

Bibcode:

2022arXiv220901776P

Keywords:

Computer Science - Robotics

E-Print:

Accepted to ICTC 2022

NASA/ADS

Indoor Path Planning for Multiple Unmanned Aerial Vehicles via Curriculum Learning

Abstract