A Linearly-Convergent Stochastic L-BFGS Algorithm
Abstract
We propose a new stochastic L-BFGS algorithm and prove a linear convergence rate for strongly convex and smooth functions. Our algorithm draws heavily from a recent stochastic variant of L-BFGS proposed in Byrd et al. (2014) as well as a recent approach to variance reduction for stochastic gradient descent from Johnson and Zhang (2013). We demonstrate experimentally that our algorithm performs well on large-scale convex and non-convex optimization problems, exhibiting linear convergence and rapidly solving the optimization problems to high levels of precision. Furthermore, we show that our algorithm performs well for a wide-range of step sizes, often differing by several orders of magnitude.
- Publication:
-
arXiv e-prints
- Pub Date:
- August 2015
- DOI:
- 10.48550/arXiv.1508.02087
- arXiv:
- arXiv:1508.02087
- Bibcode:
- 2015arXiv150802087M
- Keywords:
-
- Mathematics - Optimization and Control;
- Computer Science - Machine Learning;
- Mathematics - Numerical Analysis;
- Statistics - Computation;
- Statistics - Machine Learning
- E-Print:
- 10 pages, 3 figures in International Conference on Artificial Intelligence and Statistics, 2016