MPC-based Imitation Learning for Safe and Human-like Autonomous Driving

doi:10.48550/arXiv.2206.12348

MPC-based Imitation Learning for Safe and Human-like Autonomous Driving

To ensure user acceptance of autonomous vehicles (AVs), control systems are being developed to mimic human drivers from demonstrations of desired driving behaviors. Imitation learning (IL) algorithms serve this purpose, but struggle to provide safety guarantees on the resulting closed-loop system trajectories. On the other hand, Model Predictive Control (MPC) can handle nonlinear systems with safety constraints, but realizing human-like driving with it requires extensive domain knowledge. This work suggests the use of a seamless combination of the two techniques to learn safe AV controllers from demonstrations of desired driving behaviors, by using MPC as a differentiable control layer within a hierarchical IL policy. With this strategy, IL is performed in closed-loop and end-to-end, through parameters in the MPC cost, model or constraints. Experimental results of this methodology are analyzed for the design of a lane keeping control system, learned via behavioral cloning from observations (BCO), given human demonstrations on a fixed-base driving simulator.

Publication:

arXiv e-prints

Pub Date:

June 2022

DOI:

10.48550/arXiv.2206.12348

arXiv:

arXiv:2206.12348

Bibcode:

2022arXiv220612348A

Keywords:

Computer Science - Robotics;
Electrical Engineering and Systems Science - Systems and Control

E-Print:

Accepted at the 1st Workshop on Safe Learning for Autonomous Driving (SL4AD), co-located with the 39th International Conference on Machine Learning (ICML 2022)

NASA/ADS

MPC-based Imitation Learning for Safe and Human-like Autonomous Driving

Abstract