Computational Implications of Reducing Data to Sufficient Statistics
Abstract
Given a large dataset and an estimation task, it is common to pre-process the data by reducing them to a set of sufficient statistics. This step is often regarded as straightforward and advantageous (in that it simplifies statistical analysis). I show that -on the contrary- reducing data to sufficient statistics can change a computationally tractable estimation problem into an intractable one. I discuss connections with recent work in theoretical computer science, and implications for some techniques to estimate graphical models.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 2014
- DOI:
- 10.48550/arXiv.1409.3821
- arXiv:
- arXiv:1409.3821
- Bibcode:
- 2014arXiv1409.3821M
- Keywords:
-
- Statistics - Computation;
- Computer Science - Information Theory;
- Computer Science - Machine Learning
- E-Print:
- 20 pages