Reparameterizable Subset Sampling via Continuous Relaxations

doi:10.48550/arXiv.1901.10517

Reparameterizable Subset Sampling via Continuous Relaxations

Many machine learning tasks require sampling a subset of items from a collection based on a parameterized distribution. The Gumbel-softmax trick can be used to sample a single item, and allows for low-variance reparameterized gradients with respect to the parameters of the underlying distribution. However, stochastic optimization involving subset sampling is typically not reparameterizable. To overcome this limitation, we define a continuous relaxation of subset sampling that provides reparameterization gradients by generalizing the Gumbel-max trick. We use this approach to sample subsets of features in an instance-wise feature selection task for model interpretability, subsets of neighbors to implement a deep stochastic k-nearest neighbors model, and sub-sequences of neighbors to implement parametric t-SNE by directly comparing the identities of local neighbors. We improve performance in all these tasks by incorporating subset sampling in end-to-end training.

Publication:

arXiv e-prints

Pub Date:

January 2019

DOI:

10.48550/arXiv.1901.10517

arXiv:

arXiv:1901.10517

Bibcode:

2019arXiv190110517X

Keywords:

Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

IJCAI 2019

NASA/ADS

Reparameterizable Subset Sampling via Continuous Relaxations

Abstract