Direct Optimization through $\arg \max$ for Discrete Variational Auto-Encoder

doi:10.48550/arXiv.1806.02867

Direct Optimization through $\arg \max$ for Discrete Variational Auto-Encoder

Reparameterization of variational auto-encoders with continuous random variables is an effective method for reducing the variance of their gradient estimates. In the discrete case, one can perform reparametrization using the Gumbel-Max trick, but the resulting objective relies on an $\arg \max$ operation and is non-differentiable. In contrast to previous works which resort to softmax-based relaxations, we propose to optimize it directly by applying the direct loss minimization approach. Our proposal extends naturally to structured discrete latent variable models when evaluating the $\arg \max$ operation is tractable. We demonstrate empirically the effectiveness of the direct loss minimization technique in variational autoencoders with both unstructured and structured discrete latent variables.

Publication:

arXiv e-prints

Pub Date:

June 2018

DOI:

10.48550/arXiv.1806.02867

arXiv:

arXiv:1806.02867

Bibcode:

2018arXiv180602867L

Keywords:

Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

Accepted by Neural Information Processing Systems (NeurIPS 2019)

NASA/ADS

Direct Optimization through $\arg \max$ for Discrete Variational Auto-Encoder

Abstract