Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms

doi:10.48550/arXiv.1506.06438

Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms

Stochastic gradient descent (SGD) is a ubiquitous algorithm for a variety of machine learning problems. Researchers and industry have developed several techniques to optimize SGD's runtime performance, including asynchronous execution and reduced precision. Our main result is a martingale-based analysis that enables us to capture the rich noise models that may arise from such techniques. Specifically, we use our new analysis in three ways: (1) we derive convergence rates for the convex case (Hogwild!) with relaxed assumptions on the sparsity of the problem; (2) we analyze asynchronous SGD algorithms for non-convex matrix problems including matrix completion; and (3) we design and analyze an asynchronous SGD algorithm, called Buckwild!, that uses lower-precision arithmetic. We show experimentally that our algorithms run efficiently for a variety of problems on modern hardware.

Publication:

arXiv e-prints

Pub Date:

June 2015

DOI:

10.48550/arXiv.1506.06438

arXiv:

arXiv:1506.06438

Bibcode:

2015arXiv150606438D

Keywords:

Computer Science - Machine Learning;
Mathematics - Optimization and Control;
Statistics - Machine Learning

NASA/ADS

Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms

Abstract