SEMBED: Semantic Embedding of Egocentric Action Videos

doi:10.48550/arXiv.1607.08414

SEMBED: Semantic Embedding of Egocentric Action Videos

We present SEMBED, an approach for embedding an egocentric object interaction video in a semantic-visual graph to estimate the probability distribution over its potential semantic labels. When object interactions are annotated using unbounded choice of verbs, we embrace the wealth and ambiguity of these labels by capturing the semantic relationships as well as the visual similarities over motion and appearance features. We show how SEMBED can interpret a challenging dataset of 1225 freely annotated egocentric videos, outperforming SVM classification by more than 5%.

Publication:

arXiv e-prints

Pub Date:

July 2016

DOI:

10.48550/arXiv.1607.08414

arXiv:

arXiv:1607.08414

Bibcode:

2016arXiv160708414W

Keywords:

Computer Science - Computer Vision and Pattern Recognition

ADS

SEMBED: Semantic Embedding of Egocentric Action Videos

Abstract