PAC-Bayesian aggregation and multi-armed bandits
Abstract
This habilitation thesis presents several contributions to (1) the PAC-Bayesian analysis of statistical learning, (2) the three aggregation problems: given d functions, how to predict as well as (i) the best of these d functions (model selection type aggregation), (ii) the best convex combination of these d functions, (iii) the best linear combination of these d functions, (3) the multi-armed bandit problems.
- Publication:
-
arXiv e-prints
- Pub Date:
- November 2010
- DOI:
- arXiv:
- arXiv:1011.3396
- Bibcode:
- 2010arXiv1011.3396A
- Keywords:
-
- Mathematics - Statistics Theory