4.5 Article

Multi-Stage Hybrid Federated Learning Over Large-Scale D2D-Enabled Fog Networks

期刊

IEEE-ACM TRANSACTIONS ON NETWORKING
卷 30, 期 4, 页码 1569-1584

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNET.2022.3143495

关键词

Collaborative work; Device-to-device communication; Training; Servers; Topology; Computational modeling; Convergence; Fog learning; device-to-device communications; peer-to-peer learning; cooperative learning; distributed machine learning; semi-decentralized federated learning

资金

  1. ONR [N00014-21-1-2472]
  2. NSC [W15QKN-15-9-1004]
  3. NSF [CNS-1642982, CNS-2129015, EEC1941529, CNS-1824518]

向作者/读者索取更多资源

This paper proposes a multi-stage hybrid federated learning (MH-FL) method, extending the traditional federated learning topology through the network dimension and considering a multi-layer cluster-based structure. The research results demonstrate the advantages of MH-FL in terms of resource utilization metrics.
Federated learning has generated significant interest, with nearly all works focused on a ``star'' topology where nodes/devices are each connected to a central server. We migrate away from this architecture and extend it through the network dimension to the case where there are multiple layers of nodes between the end devices and the server. Specifically, we develop multi-stage hybrid federated learning (MH-FL), a hybrid of intra- and inter-layer model learning that considers the network as a multi-layer cluster-based structure. MH-FL considers the topology structures among the nodes in the clusters, including local networks formed via device-to-device (D2D) communications, and presumes a semi-decentralized architecture for federated learning. It orchestrates the devices at different network layers in a collaborative/cooperative manner (i.e., using D2D interactions) to form local consensus on the model parameters and combines it with multi-stage parameter relaying between layers of the tree-shaped hierarchy. We derive the upper bound of convergence for MH-FL with respect to parameters of the network topology (e.g., the spectral radius) and the learning algorithm (e.g., the number of D2D rounds in different clusters). We obtain a set of policies for the D2D rounds at different clusters to guarantee either a finite optimality gap or convergence to the global optimum. We then develop a distributed control algorithm for MH-FL to tune the D2D rounds in each cluster over time to meet specific convergence criteria. Our experiments on real-world datasets verify our analytical results and demonstrate the advantages of MH-FL in terms of resource utilization metrics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据