Complexities of convex combinations and bounding the generalization error in classification
Abstract
We introduce and study several measures of complexity of functions from the convex hull of a given base class. These complexity measures take into account the sparsity of the weights of a convex combination as well as certain clustering properties of the base functions involved in it. We prove new upper confidence bounds on the generalization error of ensemble (voting) classification algorithms that utilize the new complexity measures along with the empirical distributions of classification margins, providing a better explanation of generalization performance of large margin classification methods.
- Publication:
-
arXiv Mathematics e-prints
- Pub Date:
- May 2004
- DOI:
- arXiv:
- arXiv:math/0405356
- Bibcode:
- 2004math......5356K
- Keywords:
-
- Mathematics - Probability;
- 62G05 (Primary) 62G20;
- 60F15 (Secondary)
- E-Print:
- Published at http://dx.doi.org/10.1214/009053605000000228 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)