4.7 Article

Optimal User-Edge Assignment in Hierarchical Federated Learning Based on Statistical Properties and Network Topology Constraints

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TNSE.2021.3053588

Keywords

Data models; Computational modeling; Servers; Optimization; Synchronization; Distributed databases; Performance evaluation; Edge computing; federated learning; hierarchical federated learning; imbalanced data; user-edge assignment

Funding

  1. NPRP grant from the Qatar National Research Fund [NPRP12S-0305190231]

Ask authors/readers for more resources

Distributed learning algorithms aim to utilize diverse data stored at users' devices to learn a global phenomena. However, when the data is strongly skewed, the performance of the global model can decrease. To tackle this issue, the paper proposes a hierarchical learning system that optimizes user-edge assignment to improve model accuracy.
Distributed learning algorithms aim to leverage distributed and diverse data stored at users' devices to learn a global phenomena by performing training amongst participating devices and periodically aggregating their local models' parameters into a global model. Federated learning is a promising paradigm that allows for extending local training among the participant devices before aggregating the parameters, offering better communication efficiency. However, in the cases where the participants' data are strongly skewed (i.e., non-IID), the local models can overfit local data, leading to low performing global model. In this paper, we first show that a major cause of the performance drop is the weighted distance between the distribution over classes on users' devices and the global distribution. Then, to face this challenge, we leverage the edge computing paradigm to design a hierarchical learning system that performs Federated Gradient Descent on the user-edge layer and Federated Averaging on the edge-cloud layer. In this hierarchical architecture, we formalize and optimize this user-edge assignment problem such that edge-level data distributions turn to be similar (i.e., close to IID), which enhances the Federated Averaging performance. Our experiments on multiple real-world datasets show that the proposed optimized assignment is tractable and leads to faster convergence of models towards a better accuracy value.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available