Fast Rates by Transferring from Auxiliary Hypotheses

doi:10.48550/arXiv.1412.1619

Fast Rates by Transferring from Auxiliary Hypotheses

In this work we consider the learning setting where, in addition to the training set, the learner receives a collection of auxiliary hypotheses originating from other tasks. We focus on a broad class of ERM-based linear algorithms that can be instantiated with any non-negative smooth loss function and any strongly convex regularizer. We establish generalization and excess risk bounds, showing that, if the algorithm is fed with a good combination of source hypotheses, generalization happens at the fast rate $\mathcal{O}(1/m)$ instead of the usual $\mathcal{O}(1/\sqrt{m})$. On the other hand, if the source hypotheses combination is a misfit for the target task, we recover the usual learning rate. As a byproduct of our study, we also prove a new bound on the Rademacher complexity of the smooth loss class under weaker assumptions compared to previous works.

Publication:

arXiv e-prints

Pub Date:

December 2014

DOI:

10.48550/arXiv.1412.1619

arXiv:

arXiv:1412.1619

Bibcode:

2014arXiv1412.1619K

Keywords:

Computer Science - Machine Learning

E-Print:

doi:10.1007/s10994-016-5594-4

NASA/ADS

Fast Rates by Transferring from Auxiliary Hypotheses

Abstract