The log-linear group-lasso estimator and its asymptotic properties
Abstract
We define the group-lasso estimator for the natural parameters of the exponential families of distributions representing hierarchical log-linear models under multinomial sampling scheme. Such estimator arises as the solution of a convex penalized likelihood optimization problem based on the group-lasso penalty. We illustrate how it is possible to construct an estimator of the underlying log-linear model using the blocks of nonzero coefficients recovered by the group-lasso procedure. We investigate the asymptotic properties of the group-lasso estimator as a model selection method in a double-asymptotic framework, in which both the sample size and the model complexity grow simultaneously. We provide conditions guaranteeing that the group-lasso estimator is model selection consistent, in the sense that, with overwhelming probability as the sample size increases, it correctly identifies all the sets of nonzero interactions among the variables. Provided the sequences of true underlying models is sparse enough, recovery is possible even if the number of cells grows larger than the sample size. Finally, we derive some central limit type of results for the log-linear group-lasso estimator.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 2007
- DOI:
- 10.48550/arXiv.0709.3526
- arXiv:
- arXiv:0709.3526
- Bibcode:
- 2007arXiv0709.3526N
- Keywords:
-
- Mathematics - Statistics Theory
- E-Print:
- Published in at http://dx.doi.org/10.3150/11-BEJ364 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)