MLDemon: Deployment Monitoring for Machine Learning Systems

doi:10.48550/arXiv.2104.13621

MLDemon: Deployment Monitoring for Machine Learning Systems

Post-deployment monitoring of ML systems is critical for ensuring reliability, especially as new user inputs can differ from the training distribution. Here we propose a novel approach, MLDemon, for ML DEployment MONitoring. MLDemon integrates both unlabeled data and a small amount of on-demand labels to produce a real-time estimate of the ML model's current performance on a given data stream. Subject to budget constraints, MLDemon decides when to acquire additional, potentially costly, expert supervised labels to verify the model. On temporal datasets with diverse distribution drifts and models, MLDemon outperforms existing approaches. Moreover, we provide theoretical analysis to show that MLDemon is minimax rate optimal for a broad class of distribution drifts.

Publication:

arXiv e-prints

Pub Date:

April 2021

DOI:

10.48550/arXiv.2104.13621

arXiv:

arXiv:2104.13621

Bibcode:

2021arXiv210413621G

Keywords:

Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

Accepted to AISTATS 2022. Significant changes to algorithm, theory, and experiments since previous versions

NASA/ADS

MLDemon: Deployment Monitoring for Machine Learning Systems

Abstract