Collecting Image Description Datasets using Crowdsourcing
Abstract
We describe our two new datasets with images described by humans. Both the datasets were collected using Amazon Mechanical Turk, a crowdsourcing platform. The two datasets contain significantly more descriptions per image than other existing datasets. One is based on a popular image description dataset called the UIUC Pascal Sentence Dataset, whereas the other is based on the Abstract Scenes dataset con- taining images made from clipart objects. In this paper we describe our interfaces, analyze some properties of and show example descriptions from our two datasets.
- Publication:
-
arXiv e-prints
- Pub Date:
- November 2014
- DOI:
- 10.48550/arXiv.1411.3041
- arXiv:
- arXiv:1411.3041
- Bibcode:
- 2014arXiv1411.3041V
- Keywords:
-
- Computer Science - Computer Vision and Pattern Recognition