A Probabilistically Motivated Learning Rate Adaptation for Stochastic Optimization

doi:10.48550/arXiv.2102.10880

A Probabilistically Motivated Learning Rate Adaptation for Stochastic Optimization

Machine learning practitioners invest significant manual and computational resources in finding suitable learning rates for optimization algorithms. We provide a probabilistic motivation, in terms of Gaussian inference, for popular stochastic first-order methods. As an important special case, it recovers the Polyak step with a general metric. The inference allows us to relate the learning rate to a dimensionless quantity that can be automatically adapted during training by a control algorithm. The resulting meta-algorithm is shown to adapt learning rates in a robust manner across a large range of initial values when applied to deep learning benchmark problems.

Publication:

arXiv e-prints

Pub Date:

February 2021

DOI:

10.48550/arXiv.2102.10880

arXiv:

arXiv:2102.10880

Bibcode:

2021arXiv210210880D

Keywords:

Computer Science - Machine Learning

NASA/ADS

A Probabilistically Motivated Learning Rate Adaptation for Stochastic Optimization

Abstract