Posterior Dispersion Indices

doi:10.48550/arXiv.1605.07604

Posterior Dispersion Indices

Probabilistic modeling is cyclical: we specify a model, infer its posterior, and evaluate its performance. Evaluation drives the cycle, as we revise our model based on how it performs. This requires a metric. Traditionally, predictive accuracy prevails. Yet, predictive accuracy does not tell the whole story. We propose to evaluate a model through posterior dispersion. The idea is to analyze how each datapoint fares in relation to posterior uncertainty around the hidden structure. We propose a family of posterior dispersion indices (PDI) that capture this idea. A PDI identifies rich patterns of model mismatch in three real data examples: voting preferences, supermarket shopping, and population genetics.

Publication:

arXiv e-prints

Pub Date:

May 2016

DOI:

10.48550/arXiv.1605.07604

arXiv:

arXiv:1605.07604

Bibcode:

2016arXiv160507604K

Keywords:

Statistics - Machine Learning;
Computer Science - Artificial Intelligence;
Statistics - Computation

NASA/ADS

Posterior Dispersion Indices

Abstract