Part-based R-CNNs for Fine-grained Category Detection
Abstract
Semantic part localization can facilitate fine-grained categorization by explicitly isolating subtle appearance differences associated with specific object parts. Methods for pose-normalized representations have been proposed, but generally presume bounding box annotations at test time due to the difficulty of object detection. We propose a model for fine-grained categorization that overcomes these limitations by leveraging deep convolutional features computed on bottom-up region proposals. Our method learns whole-object and part detectors, enforces learned geometric constraints between them, and predicts a fine-grained category from a pose-normalized representation. Experiments on the Caltech-UCSD bird dataset confirm that our method outperforms state-of-the-art fine-grained categorization methods in an end-to-end evaluation without requiring a bounding box at test time.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2014
- DOI:
- 10.48550/arXiv.1407.3867
- arXiv:
- arXiv:1407.3867
- Bibcode:
- 2014arXiv1407.3867Z
- Keywords:
-
- Computer Science - Computer Vision and Pattern Recognition
- E-Print:
- 16 pages. To appear at European Conference on Computer Vision (ECCV), 2014