Slot Contrastive Networks: A Contrastive Approach for Representing Objects

doi:10.48550/arXiv.2007.09294

Slot Contrastive Networks: A Contrastive Approach for Representing Objects

Unsupervised extraction of objects from low-level visual data is an important goal for further progress in machine learning. Existing approaches for representing objects without labels use structured generative models with static images. These methods focus a large amount of their capacity on reconstructing unimportant background pixels, missing low contrast or small objects. Conversely, we present a new method that avoids losses in pixel space and over-reliance on the limited signal a static image provides. Our approach takes advantage of objects' motion by learning a discriminative, time-contrastive loss in the space of slot representations, attempting to force each slot to not only capture entities that move, but capture distinct objects from the other slots. Moreover, we introduce a new quantitative evaluation metric to measure how "diverse" a set of slot vectors are, and use it to evaluate our model on 20 Atari games.

Publication:

arXiv e-prints

Pub Date:

July 2020

DOI:

10.48550/arXiv.2007.09294

arXiv:

arXiv:2007.09294

Bibcode:

2020arXiv200709294R

Keywords:

Computer Science - Machine Learning;
Computer Science - Computer Vision and Pattern Recognition;
Statistics - Machine Learning

E-Print:

Presented at ICML 2020 Workshop: Object-Oriented Learning (OOL): Perception, Representation, and Reasoning

NASA/ADS

Slot Contrastive Networks: A Contrastive Approach for Representing Objects

Abstract