Pilot Performance modeling via observer-based inverse reinforcement learning

doi:10.48550/arXiv.2307.13150

Pilot Performance modeling via observer-based inverse reinforcement learning

The focus of this paper is behavior modeling for pilots of unmanned aerial vehicles. The pilot is assumed to make decisions that optimize an unknown cost functional, which is estimated from observed trajectories using a novel inverse reinforcement learning (IRL) framework. The resulting IRL problem often admits multiple solutions. In this paper, a recently developed novel IRL observer is adapted to the pilot modeling problem. The observer is shown to converge to one of the equivalent solutions of the IRL problem. The developed technique is implemented on a quadcopter where the pilot is a linear quadratic supervisory controller that generates velocity commands for the quadcopter to travel to and hover over a desired location. Experimental results demonstrate the robustness of the method and its ability to learn equivalent cost functionals.

Publication:

arXiv e-prints

Pub Date:

July 2023

DOI:

10.48550/arXiv.2307.13150

arXiv:

arXiv:2307.13150

Bibcode:

2023arXiv230713150T

Keywords:

Electrical Engineering and Systems Science - Systems and Control;
Mathematics - Optimization and Control

E-Print:

arXiv admin note: text overlap with arXiv:2210.16299

NASA/ADS

Pilot Performance modeling via observer-based inverse reinforcement learning

Abstract