Towards Theoretical Understanding of Data-Driven Policy Refinement

doi:10.48550/arXiv.2305.06796

Towards Theoretical Understanding of Data-Driven Policy Refinement

Baheri, Ali

This paper presents an approach for data-driven policy refinement in reinforcement learning, specifically designed for safety-critical applications. Our methodology leverages the strengths of data-driven optimization and reinforcement learning to enhance policy safety and optimality through iterative refinement. Our principal contribution lies in the mathematical formulation of this data-driven policy refinement concept. This framework systematically improves reinforcement learning policies by learning from counterexamples identified during data-driven verification. Furthermore, we present a series of theorems elucidating key theoretical properties of our approach, including convergence, robustness bounds, generalization error, and resilience to model mismatch. These results not only validate the effectiveness of our methodology but also contribute to a deeper understanding of its behavior in different environments and scenarios.

Publication:

arXiv e-prints

Pub Date:

May 2023

DOI:

10.48550/arXiv.2305.06796

arXiv:

arXiv:2305.06796

Bibcode:

2023arXiv230506796B

Keywords:

Computer Science - Machine Learning;
Electrical Engineering and Systems Science - Systems and Control

E-Print:

Accepted at the "Bridging the Gap Between AI Planning and Reinforcement Learning (PRL)" workshop at ICAPS 2023

ADS

Towards Theoretical Understanding of Data-Driven Policy Refinement

Abstract