Deep Variational Inference Without Pixel-Wise Reconstruction

doi:10.48550/arXiv.1611.05209

Deep Variational Inference Without Pixel-Wise Reconstruction

Variational autoencoders (VAEs), that are built upon deep neural networks have emerged as popular generative models in computer vision. Most of the work towards improving variational autoencoders has focused mainly on making the approximations to the posterior flexible and accurate, leading to tremendous progress. However, there have been limited efforts to replace pixel-wise reconstruction, which have known shortcomings. In this work, we use real-valued non-volume preserving transformations (real NVP) to exactly compute the conditional likelihood of the data given the latent distribution. We show that a simple VAE with this form of reconstruction is competitive with complicated VAE structures, on image modeling tasks. As part of our model, we develop powerful conditional coupling layers that enable real NVP to learn with fewer intermediate layers.

Publication:

arXiv e-prints

Pub Date:

November 2016

DOI:

10.48550/arXiv.1611.05209

arXiv:

arXiv:1611.05209

Bibcode:

2016arXiv161105209A

Keywords:

Statistics - Machine Learning;
Computer Science - Computer Vision and Pattern Recognition;
Computer Science - Machine Learning

NASA/ADS

Deep Variational Inference Without Pixel-Wise Reconstruction

Abstract