Data-Driven Yet Formal Policy Synthesis for Stochastic Nonlinear Dynamical Systems

doi:10.48550/arXiv.2501.01191

Data-Driven Yet Formal Policy Synthesis for Stochastic Nonlinear Dynamical Systems

The automated synthesis of control policies for stochastic dynamical systems presents significant challenges. A standard approach is to construct a finite-state abstraction of the continuous system, typically represented as a Markov decision process (MDP). However, generating abstractions is challenging when (1) the system's dynamics are nonlinear, and/or (2) we do not have complete knowledge of the dynamics. In this work, we introduce a novel data-driven abstraction technique for nonlinear dynamical systems with additive stochastic noise that addresses both of these issues. As a key step, we use samples of the dynamics to learn the enabled actions and transition probabilities of the abstraction. We represent abstractions as MDPs with intervals of transition probabilities, known as interval MDPs (IMDPs). These abstractions enable the synthesis of control policies for the concrete nonlinear system, with probably approximately correct (PAC) guarantees on the probability of satisfying a specified control objective. Through numerical experiments, we illustrate the effectiveness and robustness of our approach in achieving reliable control under uncertainty.

Publication:

arXiv e-prints

Pub Date:

January 2025

DOI:

10.48550/arXiv.2501.01191

arXiv:

arXiv:2501.01191

Bibcode:

2025arXiv250101191N

Keywords:

Electrical Engineering and Systems Science - Systems and Control

ADS

Data-Driven Yet Formal Policy Synthesis for Stochastic Nonlinear Dynamical Systems

Abstract