Almost Sure Asymptotic Freeness of Neural Network Jacobian with Orthogonal Weights

doi:10.48550/arXiv.1908.03901

Almost Sure Asymptotic Freeness of Neural Network Jacobian with Orthogonal Weights

Hayase, Tomohiro

A well-conditioned Jacobian spectrum has a vital role in preventing exploding or vanishing gradients and speeding up learning of deep neural networks. Free probability theory helps us to understand and handle the Jacobian spectrum. We rigorously show almost sure asymptotic freeness of layer-wise Jacobians of deep neural networks as the wide limit. In particular, we treat the case that weights are initialized as Haar distributed orthogonal matrices.

Publication:

arXiv e-prints

Pub Date:

August 2019

DOI:

10.48550/arXiv.1908.03901

arXiv:

arXiv:1908.03901

Bibcode:

2019arXiv190803901H

Keywords:

Mathematics - Probability;
Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

The proof of main theorem use the orthogonal invariance of joint distribution, which need further non-trivial discussion. Thus we withdraw this

NASA/ADS

Almost Sure Asymptotic Freeness of Neural Network Jacobian with Orthogonal Weights

Abstract