When Pearson $\chi^2$ and other divisible statistics are not goodness-of-fit tests
Abstract
Thousands of experiments are analyzed and papers are published each year involving the statistical analysis of grouped data. While this area of statistics is often perceived - somewhat naively - as saturated, several misconceptions still affect everyday practice, and new frontiers have so far remained unexplored. Researchers must be aware of the limitations affecting their analyses and what are the new possibilities in their hands. Motivated by this need, the article introduces a unifying approach to the analysis of grouped data which allows us to study the class of divisible statistics - that includes Pearson's $\chi^2$, the likelihood ratio as special cases - with a fresh perspective. The contributions collected in this manuscript span from modeling and estimation to distribution-free goodness-of-fit tests. Perhaps the most surprising result presented here is that, in a sparse regime, all tests proposed in the literature are dominated by a class of weighted linear statistics.
- Publication:
-
arXiv e-prints
- Pub Date:
- June 2024
- DOI:
- 10.48550/arXiv.2406.09195
- arXiv:
- arXiv:2406.09195
- Bibcode:
- 2024arXiv240609195A
- Keywords:
-
- Statistics - Methodology;
- Mathematics - Statistics Theory;
- Physics - Data Analysis;
- Statistics and Probability;
- Statistics - Computation