A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

doi:10.48550/arXiv.1712.07495

A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

We consider the problem of learning a high-dimensional but low-rank matrix from a large-scale dataset distributed over several machines, where low-rankness is enforced by a convex trace norm constraint. We propose DFW-Trace, a distributed Frank-Wolfe algorithm which leverages the low-rank structure of its updates to achieve efficiency in time, memory and communication usage. The step at the heart of DFW-Trace is solved approximately using a distributed version of the power method. We provide a theoretical analysis of the convergence of DFW-Trace, showing that we can ensure sublinear convergence in expectation to an optimal solution with few power iterations per epoch. We implement DFW-Trace in the Apache Spark distributed programming framework and validate the usefulness of our approach on synthetic and real data, including the ImageNet dataset with high-dimensional features extracted from a deep neural network.

Publication:

arXiv e-prints

Pub Date:

December 2017

DOI:

10.48550/arXiv.1712.07495

arXiv:

arXiv:1712.07495

Bibcode:

2017arXiv171207495Z

Keywords:

Computer Science - Distributed;
Parallel;
and Cluster Computing;
Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

doi:10.1007/s10994-018-5713-5

NASA/ADS

A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

Abstract