An Optimization Framework for Semi-Supervised and Transfer Learning using Multiple Classifiers and Clusterers

doi:10.48550/arXiv.1206.0994

An Optimization Framework for Semi-Supervised and Transfer Learning using Multiple Classifiers and Clusterers

Unsupervised models can provide supplementary soft constraints to help classify new, "target" data since similar instances in the target set are more likely to share the same class label. Such models can also help detect possible differences between training and target distributions, which is useful in applications where concept drift may take place, as in transfer learning settings. This paper describes a general optimization framework that takes as input class membership estimates from existing classifiers learnt on previously encountered "source" data, as well as a similarity matrix from a cluster ensemble operating solely on the target data to be classified, and yields a consensus labeling of the target data. This framework admits a wide range of loss functions and classification/clustering methods. It exploits properties of Bregman divergences in conjunction with Legendre duality to yield a principled and scalable approach. A variety of experiments show that the proposed framework can yield results substantially superior to those provided by popular transductive learning techniques or by naively applying classifiers learnt on the original task to the target data.

Publication:

arXiv e-prints

Pub Date:

April 2012

DOI:

10.48550/arXiv.1206.0994

arXiv:

arXiv:1206.0994

Bibcode:

2012arXiv1206.0994A

Keywords:

Computer Science - Machine Learning;
I.5.2;
I.5.3;
I.5.4

NASA/ADS

An Optimization Framework for Semi-Supervised and Transfer Learning using Multiple Classifiers and Clusterers

Abstract