Deep Variational Information Bottleneck

doi:10.48550/arXiv.1612.00410

Deep Variational Information Bottleneck

We present a variational approximation to the information bottleneck of Tishby et al. (1999). This variational approach allows us to parameterize the information bottleneck model using a neural network and leverage the reparameterization trick for efficient training. We call this method "Deep Variational Information Bottleneck", or Deep VIB. We show that models trained with the VIB objective outperform those that are trained with other forms of regularization, in terms of generalization performance and robustness to adversarial attack.

Publication:

arXiv e-prints

Pub Date:

December 2016

DOI:

10.48550/arXiv.1612.00410

arXiv:

arXiv:1612.00410

Bibcode:

2016arXiv161200410A

Keywords:

Computer Science - Machine Learning;
Computer Science - Information Theory

E-Print:

19 pages, 8 figures, Accepted to ICLR17

ADS

Deep Variational Information Bottleneck

Abstract