On the Effective Horizon of Inverse Reinforcement Learning

doi:10.48550/arXiv.2307.06541

On the Effective Horizon of Inverse Reinforcement Learning

Inverse reinforcement learning (IRL) algorithms often rely on (forward) reinforcement learning or planning over a given time horizon to compute an approximately optimal policy for a hypothesized reward function and then match this policy with expert demonstrations. The time horizon plays a critical role in determining both the accuracy of reward estimate and the computational efficiency of IRL algorithms. Interestingly, an effective time horizon shorter than the ground-truth value often produces better results faster. This work formally analyzes this phenomenon and provides an explanation: the time horizon controls the complexity of an induced policy class and mitigates overfitting with limited data. This analysis leads to a principled choice of the effective horizon for IRL. It also prompts us to reexamine the classic IRL formulation: it is more natural to learn jointly the reward and the effective horizon together rather than the reward alone with a given horizon. Our experimental results confirm the theoretical analysis.

Publication:

arXiv e-prints

Pub Date:

July 2023

DOI:

10.48550/arXiv.2307.06541

arXiv:

arXiv:2307.06541

Bibcode:

2023arXiv230706541X

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence

E-Print:

9 pages, under review

NASA/ADS

On the Effective Horizon of Inverse Reinforcement Learning

Abstract