Federated Dropout: Convergence Analysis and Resource Allocation

doi:10.48550/arXiv.2501.00379

Federated Dropout: Convergence Analysis and Resource Allocation

Federated Dropout is an efficient technique to overcome both communication and computation bottlenecks for deploying federated learning at the network edge. In each training round, an edge device only needs to update and transmit a sub-model, which is generated by the typical method of dropout in deep learning, and thus effectively reduces the per-round latency. \textcolor{blue}{However, the theoretical convergence analysis for Federated Dropout is still lacking in the literature, particularly regarding the quantitative influence of dropout rate on convergence}. To address this issue, by using the Taylor expansion method, we mathematically show that the gradient variance increases with a scaling factor of $\gamma/(1-\gamma)$, with $\gamma \in [0, \theta)$ denoting the dropout rate and $\theta$ being the maximum dropout rate ensuring the loss function reduction. Based on the above approximation, we provide the convergence analysis for Federated Dropout. Specifically, it is shown that a larger dropout rate of each device leads to a slower convergence rate. This provides a theoretical foundation for reducing the convergence latency by making a tradeoff between the per-round latency and the overall rounds till convergence. Moreover, a low-complexity algorithm is proposed to jointly optimize the dropout rate and the bandwidth allocation for minimizing the loss function in all rounds under a given per-round latency and limited network resources. Finally, numerical results are provided to verify the effectiveness of the proposed algorithm.

Publication:

arXiv e-prints

Pub Date:

December 2024

DOI:

10.48550/arXiv.2501.00379

arXiv:

arXiv:2501.00379

Bibcode:

2025arXiv250100379X

Keywords:

Computer Science - Machine Learning;
Computer Science - Information Theory

ADS

Federated Dropout: Convergence Analysis and Resource Allocation

Abstract