Trajectory Planning With Deep Reinforcement Learning in High-Level Action Spaces

doi:10.1109/TAES.2022.3218496

Trajectory Planning With Deep Reinforcement Learning in High-Level Action Spaces

This paper presents a technique for trajectory planning based on continuously parameterized high-level actions (motion primitives) of variable duration. This technique leverages deep reinforcement learning (Deep RL) to formulate a policy which is suitable for real-time implementation. There is no separation of motion primitive generation and trajectory planning: each individual short-horizon motion is formed during the Deep RL training to achieve the full-horizon objective. Effectiveness of the technique is demonstrated numerically on a well-studied trajectory generation problem and a planning problem on a known obstacle-rich map. This paper also develops a new loss function term for policy-gradient-based Deep RL, which is analogous to an anti-windup mechanism in feedback control. We demonstrate the inclusion of this new term in the underlying optimization increases the average policy return in our numerical example.

Publication:

IEEE Transactions on Aerospace Electronic Systems

Pub Date:

June 2023

DOI:

10.1109/TAES.2022.3218496

arXiv:

arXiv:2110.00044

Bibcode:

2023ITAES..59.2513W

Keywords:

Electrical Engineering and Systems Science - Systems and Control

E-Print:

IEEE Transactions on Aerospace and Electronic Systems, 59 (2023) 2513-2529

NASA/ADS

Trajectory Planning With Deep Reinforcement Learning in High-Level Action Spaces

Abstract