Kullback-Leibler aggregation and misspecified generalized linear models
Abstract
In a regression setup with deterministic design, we study the pure aggregation problem and introduce a natural extension from the Gaussian distribution to distributions in the exponential family. While this extension bears strong connections with generalized linear models, it does not require identifiability of the parameter or even that the model on the systematic component is true. It is shown that this problem can be solved by constrained and/or penalized likelihood maximization and we derive sharp oracle inequalities that hold both in expectation and with high probability. Finally all the bounds are proved to be optimal in a minimax sense.
- Publication:
-
arXiv e-prints
- Pub Date:
- November 2009
- DOI:
- 10.48550/arXiv.0911.2919
- arXiv:
- arXiv:0911.2919
- Bibcode:
- 2009arXiv0911.2919R
- Keywords:
-
- Statistics - Machine Learning;
- Mathematics - Statistics Theory
- E-Print:
- Published in at http://dx.doi.org/10.1214/11-AOS961 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)