Collecting Image Description Datasets using Crowdsourcing

doi:10.48550/arXiv.1411.3041

Collecting Image Description Datasets using Crowdsourcing

We describe our two new datasets with images described by humans. Both the datasets were collected using Amazon Mechanical Turk, a crowdsourcing platform. The two datasets contain significantly more descriptions per image than other existing datasets. One is based on a popular image description dataset called the UIUC Pascal Sentence Dataset, whereas the other is based on the Abstract Scenes dataset con- taining images made from clipart objects. In this paper we describe our interfaces, analyze some properties of and show example descriptions from our two datasets.

Publication:

arXiv e-prints

Pub Date:

November 2014

DOI:

10.48550/arXiv.1411.3041

arXiv:

arXiv:1411.3041

Bibcode:

2014arXiv1411.3041V

Keywords:

Computer Science - Computer Vision and Pattern Recognition

NASA/ADS

Collecting Image Description Datasets using Crowdsourcing

Abstract