4.8 Article

Multiagent DDPG-Based Deep Learning for Smart Ocean Federated Learning IoT Networks

Journal

IEEE INTERNET OF THINGS JOURNAL
Volume 7, Issue 10, Pages 9895-9903

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2020.2988033

Keywords

Training; Wireless communication; Computational modeling; Resource management; Data models; Oceans; Adaptation models; Deep reinforcement learning; federated learning (FL); smart ocean networks

Funding

  1. National Research Foundation of Korea [2019M3E4A1080391]
  2. Institute for Information and Communications Technology Promotion grant - Korea Government (MSIT, A Development of Driving Decision Engine for Autonomous Driving Using Driving Experience Information) [2017-0-00068]
  3. Institute for Information & Communication Technology Planning & Evaluation (IITP), Republic of Korea [2017-0-00068-004] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

This article proposes a novel multiagent deep reinforcement learning-based algorithm which can realize federated learning (FL) computation with Internet-of-Underwater-Things (IoUT) devices in the ocean environment. According to the fact that underwater networks are relatively not easy to set up reliable links by huge fading compared to wireless free-space air medium, gathering all training data for conducting centralized deep learning training is not easy. Therefore, FL-based distributed deep learning can be a suitable solution for this application. In this IoUT network (IoUT-Net) scenario, the FL system needs to construct a global learning model by aggregating the local model parameters that are obtained from individual IoUT devices. In order to reliably deliver the parameters from IoUT devices to a centralized FL machine, base station like devices are needed. Therefore, a joint cell association and resource allocation (JCARA) method is required and it is designed inspired by multiagent deep deterministic policy gradient (MADDPG) to deal with distributed situations and unexpected time-varying states. The performance evaluation results show that our proposed MADDPG-based algorithm achieves 80% and 41% performance improvements than the standard actor-critic and DDPG, respectively, in terms of the downlink throughput.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available