Online learning with stability guarantees: A memory-based real-time model predictive controller
Abstract
We propose and analyze a real-time model predictive control (MPC) scheme that utilizes stored data to improve its performance by learning the value function online with stability guarantees. For linear and nonlinear systems, a learning method is presented that makes use of basic analytic properties of the cost function and is proven to learn the MPC control law and the value function on the limit set of the closed-loop state trajectory. The main idea is to generate a smart warm start based on historical data that improves future data points and thus future warm starts. We show that these warm starts are asymptotically exact and converge to the solution of the MPC optimization problem. Thereby, the suboptimality of the applied control input resulting from the real-time requirements vanishes over time. Simulative examples show that existing real-time MPC schemes can be improved by storing data and the proposed learning scheme.
- Publication:
-
arXiv e-prints
- Pub Date:
- December 2018
- DOI:
- 10.48550/arXiv.1812.09582
- arXiv:
- arXiv:1812.09582
- Bibcode:
- 2018arXiv181209582S
- Keywords:
-
- Mathematics - Optimization and Control
- E-Print:
- This article is an extended version of the paper "Online learning with stability guarantees: A memory-based warm starting for real-time MPC" published in Automatica, Volume 122, 109247, 2020, including all proofs, an application example, and a detailed description of the used algorithm