(Implicit) Ensembles of Ensembles: Epistemic Uncertainty Collapse in Large Models

doi:10.48550/arXiv.2409.02628

(Implicit) Ensembles of Ensembles: Epistemic Uncertainty Collapse in Large Models

Kirsch, Andreas

Epistemic uncertainty is crucial for safety-critical applications and out-of-distribution detection tasks. Yet, we uncover a paradoxical phenomenon in deep learning models: an epistemic uncertainty collapse as model complexity increases, challenging the assumption that larger models invariably offer better uncertainty quantification. We propose that this stems from implicit ensembling within large models. To support this hypothesis, we demonstrate epistemic uncertainty collapse empirically across various architectures, from explicit ensembles of ensembles and simple MLPs to state-of-the-art vision models, including ResNets and Vision Transformers -- for the latter, we examine implicit ensemble extraction and decompose larger models into diverse sub-models, recovering epistemic uncertainty. We provide theoretical justification for these phenomena and explore their implications for uncertainty estimation.

Publication:

arXiv e-prints

Pub Date:

September 2024

DOI:

10.48550/arXiv.2409.02628

arXiv:

arXiv:2409.02628

Bibcode:

2024arXiv240902628K

Keywords:

Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

10 pages

NASA/ADS

(Implicit) Ensembles of Ensembles: Epistemic Uncertainty Collapse in Large Models

Abstract