Functional Central Limit Theorem and Strong Law of Large Numbers for Stochastic Gradient Langevin Dynamics
Abstract
We study the mixing properties of an important optimization algorithm of machine learning: the stochastic gradient Langevin dynamics (SGLD) with a fixed step size. The data stream is not assumed to be independent hence the SGLD is not a Markov chain, merely a \emph{Markov chain in a random environment}, which complicates the mathematical treatment considerably. We derive a strong law of large numbers and a functional central limit theorem for SGLD.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2022
- DOI:
- arXiv:
- arXiv:2210.02092
- Bibcode:
- 2022arXiv221002092L
- Keywords:
-
- Mathematics - Probability;
- Computer Science - Machine Learning;
- Mathematics - Optimization and Control;
- 60J05;
- 60J20
- E-Print:
- 16 pages