Clustering from Sparse Pairwise Measurements
Abstract
We consider the problem of grouping items into clusters based on few random pairwise comparisons between the items. We introduce three closely related algorithms for this task: a belief propagation algorithm approximating the Bayes optimal solution, and two spectral algorithms based on the non-backtracking and Bethe Hessian operators. For the case of two symmetric clusters, we conjecture that these algorithms are asymptotically optimal in that they detect the clusters as soon as it is information theoretically possible to do so. We substantiate this claim for one of the spectral approaches we introduce.
- Publication:
-
arXiv e-prints
- Pub Date:
- January 2016
- DOI:
- 10.48550/arXiv.1601.06683
- arXiv:
- arXiv:1601.06683
- Bibcode:
- 2016arXiv160106683S
- Keywords:
-
- Computer Science - Social and Information Networks;
- Condensed Matter - Disordered Systems and Neural Networks;
- Computer Science - Machine Learning
- E-Print:
- Proceedings of the 2016 IEEE International Symposium on Information Theory (ISIT) Pages: 780 - 784