A similarity learning for fine-grained images based on the Mahalanobis metric and the kernel method
Abstract
Since most prior studies on similar image retrieval focused on the category level, image similarity learning at the finegrained level remains challenge, which often leads to a semantic gap between the low-level visual features and highlevel human perception. To solve the problem, we proposed a Mahalanobis and kernel-based similarity (Mah-Ker) method combined with features developed by the Convolutional Neural Network (CNN). Firstly, triplet constraints are introduced to characterize the fine-grained image similarity relationship which the Mahalanobis metric is trained upon. Then a kernel-based metric is proposed in the last layer of model to devise nonlinear extensions of Mahalanobis metric and further enhance the performance. Experiments based on the real VIP.com dress dataset showed that our proposed method achieved a promising higher retrieval performance than both the state-of-art fine-grained similarity model and the hand-crafted visual feature based approaches.
- Publication:
-
Third International Workshop on Pattern Recognition
- Pub Date:
- July 2018
- DOI:
- 10.1117/12.2501757
- Bibcode:
- 2018SPIE10828E..15F