VISALOGY: Answering Visual Analogy Questions
Abstract
In this paper, we study the problem of answering visual analogy questions. These questions take the form of image A is to image B as image C is to what. Answering these questions entails discovering the mapping from image A to image B and then extending the mapping to image C and searching for the image D such that the relation from A to B holds for C to D. We pose this problem as learning an embedding that encourages pairs of analogous images with similar transformations to be close together using convolutional neural networks with a quadruple Siamese architecture. We introduce a dataset of visual analogy questions in natural images, and show first results of its kind on solving analogy questions on natural images.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2015
- DOI:
- 10.48550/arXiv.1510.08973
- arXiv:
- arXiv:1510.08973
- Bibcode:
- 2015arXiv151008973S
- Keywords:
-
- Computer Science - Computer Vision and Pattern Recognition
- E-Print:
- To appear in NIPS 2015