Training Classifiers For Feedback Control
Abstract
One approach for feedback control using high dimensional and rich sensor measurements is to classify the measurement into one out of a finite set of situations, each situation corresponding to a (known) control action. This approach computes a control action without estimating the state. Such classifiers are typically learned from a finite amount of data using supervised machine learning algorithms. We model the closed-loop system resulting from control with feedback from classifier outputs as a piece-wise affine differential inclusion. We show how to train a linear classifier based on performance measures related to learning from data and the local stability properties of the resulting closed-loop system. The training method is based on the projected gradient descent algorithm. We demonstrate the advantage of training classifiers using control-theoretic properties on a case study involving navigation using range-based sensors.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2019
- arXiv:
- arXiv:1903.03688
- Bibcode:
- 2019arXiv190303688P
- Keywords:
-
- Mathematics - Optimization and Control
- E-Print:
- 9 pages, conference