Optimal Importance Sampling for Federated Learning

doi:10.48550/arXiv.2010.13600

Optimal Importance Sampling for Federated Learning

Federated learning involves a mixture of centralized and decentralized processing tasks, where a server regularly selects a sample of the agents and these in turn sample their local data to compute stochastic gradients for their learning updates. This process runs continually. The sampling of both agents and data is generally uniform; however, in this work we consider non-uniform sampling. We derive optimal importance sampling strategies for both agent and data selection and show that non-uniform sampling without replacement improves the performance of the original FedAvg algorithm. We run experiments on a regression and classification problem to illustrate the theoretical results.

Publication:

arXiv e-prints

Pub Date:

October 2020

DOI:

10.48550/arXiv.2010.13600

arXiv:

arXiv:2010.13600

Bibcode:

2020arXiv201013600R

Keywords:

Computer Science - Machine Learning;
Computer Science - Distributed;
Parallel;
and Cluster Computing

NASA/ADS

Optimal Importance Sampling for Federated Learning

Abstract