Stochastic Nonconvex Optimization with Large Minibatches

doi:10.48550/arXiv.1709.08728

Stochastic Nonconvex Optimization with Large Minibatches

We study stochastic optimization of nonconvex loss functions, which are typical objectives for training neural networks. We propose stochastic approximation algorithms which optimize a series of regularized, nonlinearized losses on large minibatches of samples, using only first-order gradient information. Our algorithms provably converge to an approximate critical point of the expected objective with faster rates than minibatch stochastic gradient descent, and facilitate better parallelization by allowing larger minibatches.

Publication:

arXiv e-prints

Pub Date:

September 2017

DOI:

10.48550/arXiv.1709.08728

arXiv:

arXiv:1709.08728

Bibcode:

2017arXiv170908728W

Keywords:

Computer Science - Machine Learning

E-Print:

Accepted by the ALT 2019

NASA/ADS

Stochastic Nonconvex Optimization with Large Minibatches

Abstract