4.8 Article

Exploring Deep-Reinforcement-Learning-Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT

Journal

IEEE INTERNET OF THINGS JOURNAL
Volume 9, Issue 21, Pages 21099-21110

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2022.3176739

Keywords

Internet of Things; Training; Computational modeling; Resource management; Servers; Energy consumption; Data models; Deep reinforcement learning (DRL); federated learning (FL); Internet of Things; mobile-edge computing (MEC); online resource allocation

Funding

  1. National Funds through FCT/MCTES (Portuguese Foundation for Science and Technology), within the CISTER Research Unit [UIDP/UIDB/04234/2020]
  2. National Funds through FCT [PTDC/EEICOM/3362/2021]

Ask authors/readers for more resources

This article investigates the problem of protecting data training privacy in the Internet of Things. By selecting IoT devices with different data set sizes, the balance between learning accuracy and energy consumption can be achieved. The authors propose a new framework to balance the accuracy and energy consumption of FL, and leverage LSTM to predict network state.
Federated learning (FL) has been increasingly considered to preserve data training privacy from eavesdropping attacks in mobile-edge computing-based Internet of Things (EdgeIoT). On the one hand, the learning accuracy of FL can be improved by selecting the IoT devices with large data sets for training, which gives rise to a higher energy consumption. On the other hand, the energy consumption can be reduced by selecting the IoT devices with small data sets for FL, resulting in a falling learning accuracy. In this article, we formulate a new resource allocation problem for privacy-preserving EdgeIoT to balance the learning accuracy of FL and the energy consumption of the IoT device. We propose a new FL-enabled twin-delayed deep deterministic policy gradient (FL-DLT3) framework to achieve the optimal accuracy and energy balance in a continuous domain. Furthermore, long short-term memory (LSTM) is leveraged in FL-DLT3 to predict the time-varying network state while FL-DLT3 is trained to select the IoT devices and allocate the transmit power. Numerical results demonstrate that the proposed FL-DLT3 achieves fast convergence (less than 100 iterations) while the FL accuracy-to-energy consumption ratio is improved by 51.8% compared to the existing state-of-the-art benchmark.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available