Increasing Expressivity of a Hyperspherical VAE

doi:10.48550/arXiv.1910.02912

Increasing Expressivity of a Hyperspherical VAE

Learning suitable latent representations for observed, high-dimensional data is an important research topic underlying many recent advances in machine learning. While traditionally the Gaussian normal distribution has been the go-to latent parameterization, recently a variety of works have successfully proposed the use of manifold-valued latents. In one such work (Davidson et al., 2018), the authors empirically show the potential benefits of using a hyperspherical von Mises-Fisher (vMF) distribution in low dimensionality. However, due to the unique distributional form of the vMF, expressivity in higher dimensional space is limited as a result of its scalar concentration parameter leading to a 'hyperspherical bottleneck'. In this work we propose to extend the usability of hyperspherical parameterizations to higher dimensions using a product-space instead, showing improved results on a selection of image datasets.

Publication:

arXiv e-prints

Pub Date:

October 2019

DOI:

10.48550/arXiv.1910.02912

arXiv:

arXiv:1910.02912

Bibcode:

2019arXiv191002912D

Keywords:

Statistics - Machine Learning;
Computer Science - Machine Learning

E-Print:

NeurIPS 2019, in Workshop on Bayesian Deep Learning

NASA/ADS

Increasing Expressivity of a Hyperspherical VAE

Abstract