What am I Searching for: Zero-shot Target Identity Inference in Visual Search

doi:10.48550/arXiv.2005.12741

What am I Searching for: Zero-shot Target Identity Inference in Visual Search

Can we infer intentions from a person's actions? As an example problem, here we consider how to decipher what a person is searching for by decoding their eye movement behavior. We conducted two psychophysics experiments where we monitored eye movements while subjects searched for a target object. We defined the fixations falling on \textit{non-target} objects as "error fixations". Using those error fixations, we developed a model (InferNet) to infer what the target was. InferNet uses a pre-trained convolutional neural network to extract features from the error fixations and computes a similarity map between the error fixations and all locations across the search image. The model consolidates the similarity maps across layers and integrates these maps across all error fixations. InferNet successfully identifies the subject's goal and outperforms competitive null models, even without any object-specific training on the inference task.

Publication:

arXiv e-prints

Pub Date:

May 2020

DOI:

10.48550/arXiv.2005.12741

arXiv:

arXiv:2005.12741

Bibcode:

2020arXiv200512741Z

Keywords:

Computer Science - Computer Vision and Pattern Recognition;
Computer Science - Artificial Intelligence;
Computer Science - Machine Learning

E-Print:

this was a mistaken new submission and a pointer to arXiv:1807.11926

NASA/ADS

What am I Searching for: Zero-shot Target Identity Inference in Visual Search

Abstract