Learning an arbitrary mixture of two multinomial logits
Abstract
In this paper, we consider mixtures of multinomial logistic models (MNL), which are known to $\epsilon$-approximate any random utility model. Despite its long history and broad use, rigorous results are only available for learning a uniform mixture of two MNLs. Continuing this line of research, we study the problem of learning an arbitrary mixture of two MNLs. We show that the identifiability of the mixture models may only fail on an algebraic variety of a negligible measure. This is done by reducing the problem of learning a mixture of two MNLs to the problem of solving a system of univariate quartic equations. We also devise an algorithm to learn any mixture of two MNLs using a polynomial number of samples and a linear number of queries, provided that a mixture of two MNLs over some finite universe is identifiable. Several numerical experiments and conjectures are also presented.
- Publication:
-
arXiv e-prints
- Pub Date:
- June 2020
- DOI:
- 10.48550/arXiv.2007.00204
- arXiv:
- arXiv:2007.00204
- Bibcode:
- 2020arXiv200700204T
- Keywords:
-
- Statistics - Machine Learning;
- Computer Science - Computational Complexity;
- Computer Science - Machine Learning;
- Computer Science - Symbolic Computation
- E-Print:
- 14 pages