Learning to Approximate a Bregman Divergence

doi:10.48550/arXiv.1905.11545

Learning to Approximate a Bregman Divergence

Bregman divergences generalize measures such as the squared Euclidean distance and the KL divergence, and arise throughout many areas of machine learning. In this paper, we focus on the problem of approximating an arbitrary Bregman divergence from supervision, and we provide a well-principled approach to analyzing such approximations. We develop a formulation and algorithm for learning arbitrary Bregman divergences based on approximating their underlying convex generating function via a piecewise linear function. We provide theoretical approximation bounds using our parameterization and show that the generalization error $O_p(m^{-1/2})$ for metric learning using our framework matches the known generalization error in the strictly less general Mahalanobis metric learning setting. We further demonstrate empirically that our method performs well in comparison to existing metric learning methods, particularly for clustering and ranking problems.

Publication:

arXiv e-prints

Pub Date:

May 2019

DOI:

10.48550/arXiv.1905.11545

arXiv:

arXiv:1905.11545

Bibcode:

2019arXiv190511545S

Keywords:

Statistics - Machine Learning;
Computer Science - Machine Learning

E-Print:

19 pages, 4 figures

NASA/ADS

Learning to Approximate a Bregman Divergence

Abstract