Canonical Estimation in a Rare-Events Regime

doi:10.48550/arXiv.1109.4564

Canonical Estimation in a Rare-Events Regime

We propose a general methodology for performing statistical inference within a `rare-events regime' that was recently suggested by Wagner, Viswanath and Kulkarni. Our approach allows one to easily establish consistent estimators for a very large class of canonical estimation problems, in a large alphabet setting. These include the problems studied in the original paper, such as entropy and probability estimation, in addition to many other interesting ones. We particularly illustrate this approach by consistently estimating the size of the alphabet and the range of the probabilities. We start by proposing an abstract methodology based on constructing a probability measure with the desired asymptotic properties. We then demonstrate two concrete constructions by casting the Good-Turing estimator as a pseudo-empirical measure, and by using the theory of mixture model estimation.

Publication:

arXiv e-prints

Pub Date:

September 2011

DOI:

10.48550/arXiv.1109.4564

arXiv:

arXiv:1109.4564

Bibcode:

2011arXiv1109.4564O

Keywords:

Computer Science - Information Theory;
Mathematics - Statistics Theory

E-Print:

To be presented at Allerton conference 2011

NASA/ADS

Canonical Estimation in a Rare-Events Regime

Abstract