A Linearly-Convergent Stochastic L-BFGS Algorithm

doi:10.48550/arXiv.1508.02087

A Linearly-Convergent Stochastic L-BFGS Algorithm

We propose a new stochastic L-BFGS algorithm and prove a linear convergence rate for strongly convex and smooth functions. Our algorithm draws heavily from a recent stochastic variant of L-BFGS proposed in Byrd et al. (2014) as well as a recent approach to variance reduction for stochastic gradient descent from Johnson and Zhang (2013). We demonstrate experimentally that our algorithm performs well on large-scale convex and non-convex optimization problems, exhibiting linear convergence and rapidly solving the optimization problems to high levels of precision. Furthermore, we show that our algorithm performs well for a wide-range of step sizes, often differing by several orders of magnitude.

Publication:

arXiv e-prints

Pub Date:

August 2015

DOI:

10.48550/arXiv.1508.02087

arXiv:

arXiv:1508.02087

Bibcode:

2015arXiv150802087M

Keywords:

Mathematics - Optimization and Control;
Computer Science - Machine Learning;
Mathematics - Numerical Analysis;
Statistics - Computation;
Statistics - Machine Learning

E-Print:

10 pages, 3 figures in International Conference on Artificial Intelligence and Statistics, 2016

NASA/ADS

A Linearly-Convergent Stochastic L-BFGS Algorithm

Abstract