Hybrid Summary Statistics
Abstract
We present a way to capture high-information posteriors from training sets that are sparsely sampled over the parameter space for robust simulation-based inference. In physical inference problems, we can often apply domain knowledge to define traditional summary statistics to capture some of the information in a dataset. We show that augmenting these statistics with neural network outputs to maximise the mutual information improves information extraction compared to neural summaries alone or their concatenation to existing summaries and makes inference robust in settings with low training data. We introduce 1) two loss formalisms to achieve this and 2) apply the technique to two different cosmological datasets to extract non-Gaussian parameter information.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2024
- DOI:
- 10.48550/arXiv.2410.07548
- arXiv:
- arXiv:2410.07548
- Bibcode:
- 2024arXiv241007548M
- Keywords:
-
- Statistics - Machine Learning;
- Astrophysics - Cosmology and Nongalactic Astrophysics;
- Computer Science - Information Theory;
- Computer Science - Machine Learning;
- Physics - Data Analysis;
- Statistics and Probability
- E-Print:
- 7 pages, 4 figures. Accepted to ML4PS2024 at NeurIPS 2024