Estimating the Arc Length of the Optimal ROC Curve and Lower Bounding the Maximal AUC
Abstract
In this paper, we show the arc length of the optimal ROC curve is an $f$-divergence. By leveraging this result, we express the arc length using a variational objective and estimate it accurately using positive and negative samples. We show this estimator has a non-parametric convergence rate $O_p(n^{-\beta/4})$ ($\beta \in (0,1]$ depends on the smoothness). Using the same technique, we show the surface area between the optimal ROC curve and the diagonal can be expressed via a similar variational objective. These new insights lead to a novel classification procedure that maximizes an approximate lower bound of the maximal AUC. Experiments on CIFAR-10 datasets show the proposed two-step procedure achieves good AUC performance in imbalanced binary classification tasks.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2021
- DOI:
- 10.48550/arXiv.2110.09651
- arXiv:
- arXiv:2110.09651
- Bibcode:
- 2021arXiv211009651L
- Keywords:
-
- Mathematics - Statistics Theory;
- Statistics - Machine Learning
- E-Print:
- Camera-ready for NeurIPS 2022