Nonuniqueness and Convergence to Equivalent Solutions in Observer-based Inverse Reinforcement Learning
Abstract
A key challenge in solving the deterministic inverse reinforcement learning (IRL) problem online and in real-time is the existence of multiple solutions. Nonuniqueness necessitates the study of the notion of equivalent solutions, i.e., solutions that result in a different cost functional but same feedback matrix, and convergence to such solutions. While offline algorithms that result in convergence to equivalent solutions have been developed in the literature, online, real-time techniques that address nonuniqueness are not available. In this paper, a regularized history stack observer that converges to approximately equivalent solutions of the IRL problem is developed. Novel data-richness conditions are developed to facilitate the analysis and simulation results are provided to demonstrate the effectiveness of the developed technique.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2022
- DOI:
- 10.48550/arXiv.2210.16299
- arXiv:
- arXiv:2210.16299
- Bibcode:
- 2022arXiv221016299T
- Keywords:
-
- Electrical Engineering and Systems Science - Systems and Control;
- Computer Science - Machine Learning