4.8 Article

Lazily Aggregated Quantized Gradient Innovation for Communication-Efficient Federated Learning

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TPAMI.2020.3033286

关键词

Quantization (signal); Servers; Technological innovation; Convergence; Frequency modulation; Distributed databases; Collaborative work; Federated learning; communication-efficient; gradient innovation; quantization

资金

  1. NSFC [61873118]
  2. Shenzhen Committee on Science and Innovations [GJHZ20180411143603361]
  3. Department of Science and Technology of Guangdong Province [2018A050506003]
  4. China Scholarship Council
  5. Key-Area Research and Development Program of Guangdong Province [2018B010107002]
  6. National Natural Science Foundation of China [61751205]
  7. NSF [1901134]

向作者/读者索取更多资源

This paper proposes an adaptive communication method for the federated learning problem, which saves communication costs by quantizing gradients and skipping less informative communications. Extensive experiments validate the effectiveness of this method.
This paper focuses on communication-efficient federated learning problem, and develops a novel distributed quantized gradient approach, which is characterized by adaptive communications of the quantized gradients. Specifically, the federated learning builds upon the server-worker infrastructure, where the workers calculate local gradients and upload them to the server; then the server obtain the global gradient by aggregating all the local gradients and utilizes it to update the model parameter. The key idea to save communications from the worker to the server is to quantize gradients as well as skip less informative quantized gradient communications by reusing previous gradients. Quantizing and skipping result in 'lazy' worker-server communications, which justifies the term Lazily Aggregated Quantized (LAQ) gradient. Theoretically, the LAQ algorithm achieves the same linear convergence as the gradient descent in the strongly convex case, while effecting major savings in the communication in terms of transmitted bits and communication rounds. Empirically, extensive experiments using realistic data corroborate a significant communication reduction compared with state-of-the-art gradient- and stochastic gradient-based algorithms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据