A General Framework for Learning-Based Distributionally Robust MPC of Markov Jump Systems

doi:10.48550/arXiv.2106.00561

A General Framework for Learning-Based Distributionally Robust MPC of Markov Jump Systems

We present a learning model predictive control (MPC) scheme for chance-constrained Markov jump systems with unknown switching probabilities. Using samples of the underlying Markov chain, ambiguity sets of transition probabilities are estimated which include the true conditional probability distributions with high probability. These sets are updated online and used to formulate a time-varying, risk-averse optimal control problem. We prove recursive feasibility of the resulting MPC scheme and show that the original chance constraints remain satisfied at every time step. Furthermore, we show that under sufficient decrease of the confidence levels, the resulting MPC scheme renders the closed-loop system mean-square stable with respect to the true-but-unknown distributions, while remaining less conservative than a fully robust approach. Finally, we show that the value function of the learning MPC converges from above to its nominal counterpart as the sample size grows to infinity. We illustrate our approach on a numerical example.

Publication:

arXiv e-prints

Pub Date:

June 2021

DOI:

10.48550/arXiv.2106.00561

arXiv:

arXiv:2106.00561

Bibcode:

2021arXiv210600561S

Keywords:

Mathematics - Optimization and Control;
Electrical Engineering and Systems Science - Systems and Control

E-Print:

Revised version

NASA/ADS

A General Framework for Learning-Based Distributionally Robust MPC of Markov Jump Systems

Abstract