Les Houches lectures on deep learning at large and infinite width
Abstract
These lectures, presented at the 2022 Les Houches Summer School on Statistical Physics and Machine Learning, focus on the infinite-width limit and large-width regime of deep neural networks. Topics covered include the various statistical and dynamical properties of these networks. In particular, the lecturers discuss properties of random deep neural networks, connections between trained deep neural networks, linear models, kernels and Gaussian processes that arise in the infinite-width limit, and perturbative and non-perturbative treatments of large but finite-width networks, at initialization and after training.
- Publication:
-
Journal of Statistical Mechanics: Theory and Experiment
- Pub Date:
- October 2024
- DOI:
- arXiv:
- arXiv:2309.01592
- Bibcode:
- 2024JSMTE2024j4012B
- Keywords:
-
- deep learning;
- machine learning;
- stochastic processes;
- critical behavior of disordered systems;
- Statistics - Machine Learning;
- Computer Science - Artificial Intelligence;
- Computer Science - Machine Learning;
- High Energy Physics - Theory;
- Mathematics - Probability
- E-Print:
- These are notes from lectures delivered by Yasaman Bahri and Boris Hanin at the 2022 Les Houches Summer School on Statistics Physics and Machine Learning and a first version of them were transcribed by Antonin Brossollet, Vittorio Erba, Christian Keup, Rosalba Pacelli, James B. Simon