4.8 Article

Federated Deep Reinforcement Learning for Internet of Things With Decentralized Cooperative Edge Caching

Journal

IEEE INTERNET OF THINGS JOURNAL
Volume 7, Issue 10, Pages 9441-9455

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2020.2986803

Keywords

Internet of Things; Training; Delays; Machine learning; Simulation; Electronic mail; Wireless communication; Cooperative caching; deep reinforcement learning (DRL); edge caching; federated learning; hit rate; Internet of Things (IoT)

Funding

  1. National Key Research and Development Program of China [2019YFB2101901, 2018YFC0809803, 2018YFF0214700, 2018YFB2100100]
  2. China NSFC [61702364, 61902044]
  3. Chongqing Research Program of Basic Research and Frontier Technology [cstc2019jcyj-msxmX0589]
  4. Chinese National Engineering Laboratory for Big Data System Computing Technology
  5. Canadian NSERC
  6. European Unions [871780]
  7. Academy of Finland CSN Project [311654]
  8. Academy of Finland 6Genesis Project [318927]

Ask authors/readers for more resources

Edge caching is an emerging technology for addressing massive content access in mobile networks to support rapidly growing Internet-of-Things (IoT) services and applications. However, most current optimization-based methods lack a self-adaptive ability in dynamic environments. To tackle these challenges, current learning-based approaches are generally proposed in a centralized way. However, network resources may be overconsumed during the training and data transmission process. To address the complex and dynamic control issues, we propose a federated deep-reinforcement-learning-based cooperative edge caching (FADE) framework. FADE enables base stations (BSs) to cooperatively learn a shared predictive model by considering the first-round training parameters of the BSs as the initial input of the local training, and then uploads near-optimal local parameters to the BSs to participate in the next round of global training. Furthermore, we prove the expectation convergence of FADE. Trace-driven simulation results demonstrate the effectiveness of the proposed FADE framework on reducing the performance loss and average delay, offloading backhaul traffic, and improving the hit rate.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available