Masked Autoencoders for Egocentric Video Understanding @ Ego4D Challenge 2022
Abstract
In this report, we present our approach and empirical results of applying masked autoencoders in two egocentric video understanding tasks, namely, Object State Change Classification and PNR Temporal Localization, of Ego4D Challenge 2022. As team TheSSVL, we ranked 2nd place in both tasks. Our code will be made available.
- Publication:
-
arXiv e-prints
- Pub Date:
- November 2022
- DOI:
- 10.48550/arXiv.2211.15286
- arXiv:
- arXiv:2211.15286
- Bibcode:
- 2022arXiv221115286L
- Keywords:
-
- Computer Science - Computer Vision and Pattern Recognition
- E-Print:
- 5 pages