Scalable Bayesian Modelling of Paired Symbols
Abstract
We present a novel, scalable and Bayesian approach to modelling the occurrence of pairs of symbols (i,j) drawn from a large vocabulary. Observed pairs are assumed to be generated by a simple popularity based selection process followed by censoring using a preference function. By basing inference on the well-founded principle of variational bounding, and using new site-independent bounds, we show how a scalable inference procedure can be obtained for large data sets. State of the art results are presented on real-world movie viewing data.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 2014
- DOI:
- 10.48550/arXiv.1409.2824
- arXiv:
- arXiv:1409.2824
- Bibcode:
- 2014arXiv1409.2824P
- Keywords:
-
- Statistics - Machine Learning
- E-Print:
- 15 pages, 6 figures