Analysing object detectors from the perspective of co-occurring object categories
Abstract
The accuracy of state-of-the-art Faster R-CNN and YOLO object detectors are evaluated and compared on a special masked MS COCO dataset to measure how much their predictions rely on contextual information encoded at object category level. Category level representation of context is motivated by the fact that it could be an adequate way to transfer knowledge between visual and non-visual domains. According to our measurements, current detectors usually do not build strong dependency on contextual information at category level, however, when they does, they does it in a similar way, suggesting that contextual dependence of object categories is an independent property that is relevant to be transferred.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 2018
- DOI:
- 10.48550/arXiv.1809.08132
- arXiv:
- arXiv:1809.08132
- Bibcode:
- 2018arXiv180908132N
- Keywords:
-
- Computer Science - Computer Vision and Pattern Recognition
- E-Print:
- accepted to 9th IEEE International Conference on Cognitive InfoCommunications