Supervised Collective Classification for Crowdsourcing

doi:10.48550/arXiv.1507.06682

Supervised Collective Classification for Crowdsourcing

Crowdsourcing utilizes the wisdom of crowds for collective classification via information (e.g., labels of an item) provided by labelers. Current crowdsourcing algorithms are mainly unsupervised methods that are unaware of the quality of crowdsourced data. In this paper, we propose a supervised collective classification algorithm that aims to identify reliable labelers from the training data (e.g., items with known labels). The reliability (i.e., weighting factor) of each labeler is determined via a saddle point algorithm. The results on several crowdsourced data show that supervised methods can achieve better classification accuracy than unsupervised methods, and our proposed method outperforms other algorithms.

Publication:

arXiv e-prints

Pub Date:

July 2015

DOI:

10.48550/arXiv.1507.06682

arXiv:

arXiv:1507.06682

Bibcode:

2015arXiv150706682C

Keywords:

Computer Science - Social and Information Networks;
Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

to appear in IEEE Global Communications Conference (GLOBECOM) Workshop on Networking and Collaboration Issues for the Internet of Everything

NASA/ADS

Supervised Collective Classification for Crowdsourcing

Abstract