Invariant Risk Minimization
Abstract
We introduce Invariant Risk Minimization (IRM), a learning paradigm to estimate invariant correlations across multiple training distributions. To achieve this goal, IRM learns a data representation such that the optimal classifier, on top of that data representation, matches for all training distributions. Through theory and experiments, we show how the invariances learned by IRM relate to the causal structures governing the data and enable out-of-distribution generalization.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2019
- DOI:
- arXiv:
- arXiv:1907.02893
- Bibcode:
- 2019arXiv190702893A
- Keywords:
-
- Statistics - Machine Learning;
- Computer Science - Artificial Intelligence;
- Computer Science - Machine Learning