Gradient Descent in RKHS with Importance Labeling

doi:10.48550/arXiv.2006.10925

Gradient Descent in RKHS with Importance Labeling

Labeling cost is often expensive and is a fundamental limitation of supervised learning. In this paper, we study importance labeling problem, in which we are given many unlabeled data and select a limited number of data to be labeled from the unlabeled data, and then a learning algorithm is executed on the selected one. We propose a new importance labeling scheme that can effectively select an informative subset of unlabeled data in least squares regression in Reproducing Kernel Hilbert Spaces (RKHS). We analyze the generalization error of gradient descent combined with our labeling scheme and show that the proposed algorithm achieves the optimal rate of convergence in much wider settings and especially gives much better generalization ability in a small label noise setting than the usual uniform sampling scheme. Numerical experiments verify our theoretical findings.

Publication:

arXiv e-prints

Pub Date:

June 2020

DOI:

10.48550/arXiv.2006.10925

arXiv:

arXiv:2006.10925

Bibcode:

2020arXiv200610925M

Keywords:

Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

18 pages, 14 figures

NASA/ADS

Gradient Descent in RKHS with Importance Labeling

Abstract