Parallel Stochastic Asynchronous Coordinate Descent: Tight Bounds on the Possible Parallelism
Abstract
Several works have shown linear speedup is achieved by an asynchronous parallel implementation of stochastic coordinate descent so long as there is not too much parallelism. More specifically, it is known that if all updates are of similar duration, then linear speedup is possible with up to $\Theta(\sqrt n L_{\max}/L_{\overline{\mathrm{res}}})$ processors, where $L_{\max}$ and $L_{\overline{\mathrm{res}}}$ are suitable Lipschitz parameters. This paper shows the bound is tight for almost all possible values of these parameters.
- Publication:
-
arXiv e-prints
- Pub Date:
- November 2018
- DOI:
- 10.48550/arXiv.1811.05087
- arXiv:
- arXiv:1811.05087
- Bibcode:
- 2018arXiv181105087K
- Keywords:
-
- Mathematics - Optimization and Control;
- Computer Science - Distributed;
- Parallel;
- and Cluster Computing