Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks

doi:10.48550/arXiv.2408.16445

Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks

Three-dimensional (3D) reconstruction from two-dimensional images is an active research field in computer vision, with applications ranging from navigation and object tracking to segmentation and three-dimensional modeling. Traditionally, parametric techniques have been employed for this task. However, recent advancements have seen a shift towards learning-based methods. Given the rapid pace of research and the frequent introduction of new image matching methods, it is essential to evaluate them. In this paper, we present a comprehensive evaluation of various image matching methods using a structure-from-motion pipeline. We assess the performance of these methods on both in-domain and out-of-domain datasets, identifying key limitations in both the methods and benchmarks. We also investigate the impact of edge detection as a pre-processing step. Our analysis reveals that image matching for 3D reconstruction remains an open challenge, necessitating careful selection and tuning of models for specific scenarios, while also highlighting mismatches in how metrics currently represent method performance.

Publication:

arXiv e-prints

Pub Date:

August 2024

DOI:

10.48550/arXiv.2408.16445

arXiv:

arXiv:2408.16445

Bibcode:

2024arXiv240816445B

Keywords:

Computer Science - Computer Vision and Pattern Recognition

E-Print:

19 pages, 5 figures

NASA/ADS

Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks

Abstract