Characterizing predictable classes of processes
Abstract
The problem is sequence prediction in the following setting. A sequence x1,..., xn,... of discrete-valued observations is generated according to some unknown probabilistic law (measure) mu. After observing each outcome, it is required to give the conditional probabilities of the next observation. The measure mu belongs to an arbitrary class C of stochastic processes. We are interested in predictors ? whose conditional probabilities converge to the 'true' mu-conditional probabilities if any mu { C is chosen to generate the data. We show that if such a predictor exists, then a predictor can also be obtained as a convex combination of a countably many elements of C. In other words, it can be obtained as a Bayesian predictor whose prior is concentrated on a countable set. This result is established for two very different measures of performance of prediction, one of which is very strong, namely, total variation, and the other is very weak, namely, prediction in expected average Kullback-Leibler divergence.
- Publication:
-
arXiv e-prints
- Pub Date:
- August 2014
- DOI:
- 10.48550/arXiv.1408.2036
- arXiv:
- arXiv:1408.2036
- Bibcode:
- 2014arXiv1408.2036R
- Keywords:
-
- Computer Science - Machine Learning;
- Statistics - Machine Learning
- E-Print:
- This is a duplicate submission of 0905.4341, made by UAI foundation who had the brilliant idea of flooding arxiv with UAI papers 5 years after the conference, without checking whether these papers were already submitted to arxiv or at least asking the authors. Great job, UAI! The journal (extended) version appears in JMLR, 11: 581-602, 2010