Riemannian metrics for neural networks I: feedforward networks

doi:10.48550/arXiv.1303.0818

Riemannian metrics for neural networks I: feedforward networks

Ollivier, Yann

We describe four algorithms for neural network training, each adapted to different scalability constraints. These algorithms are mathematically principled and invariant under a number of transformations in data and network representation, from which performance is thus independent. These algorithms are obtained from the setting of differential geometry, and are based on either the natural gradient using the Fisher information matrix, or on Hessian methods, scaled down in a specific way to allow for scalability while keeping some of their key mathematical properties.

Publication:

arXiv e-prints

Pub Date:

March 2013

DOI:

10.48550/arXiv.1303.0818

arXiv:

arXiv:1303.0818

Bibcode:

2013arXiv1303.0818O

Keywords:

Computer Science - Neural and Evolutionary Computing;
Computer Science - Information Theory;
Computer Science - Machine Learning;
Mathematics - Differential Geometry;
68T05

E-Print:

(5th version, minor changes)

NASA/ADS

Riemannian metrics for neural networks I: feedforward networks

Abstract