Decreasing Annotation Burden of Pairwise Comparisons with Human-in-the-Loop Sorting: Application in Medical Image Artifact Rating

doi:10.48550/arXiv.2202.04823

Decreasing Annotation Burden of Pairwise Comparisons with Human-in-the-Loop Sorting: Application in Medical Image Artifact Rating

Ranking by pairwise comparisons has shown improved reliability over ordinal classification. However, as the annotations of pairwise comparisons scale quadratically, this becomes less practical when the dataset is large. We propose a method for reducing the number of pairwise comparisons required to rank by a quantitative metric, demonstrating the effectiveness of the approach in ranking medical images by image quality in this proof of concept study. Using the medical image annotation software that we developed, we actively subsample pairwise comparisons using a sorting algorithm with a human rater in the loop. We find that this method substantially reduces the number of comparisons required for a full ordinal ranking without compromising inter-rater reliability when compared to pairwise comparisons without sorting.

Publication:

arXiv e-prints

Pub Date:

February 2022

DOI:

10.48550/arXiv.2202.04823

arXiv:

arXiv:2202.04823

Bibcode:

2022arXiv220204823J

Keywords:

Quantitative Biology - Quantitative Methods;
Computer Science - Computer Vision and Pattern Recognition;
Computer Science - Machine Learning;
Electrical Engineering and Systems Science - Image and Video Processing;
I.2.1

E-Print:

5 pages, 2 figures, NeurIPS Data-Centric AI Workshop 2021

NASA/ADS

Decreasing Annotation Burden of Pairwise Comparisons with Human-in-the-Loop Sorting: Application in Medical Image Artifact Rating

Abstract