4.8 Article

Soft Actor-Critic-Based Computation Offloading in Multiuser MEC-Enabled IoT-A Lifetime Maximization Perspective

Journal

IEEE INTERNET OF THINGS JOURNAL
Volume 10, Issue 20, Pages 17571-17584

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2023.3277753

Keywords

Deep reinforcement learning (DRL); Internet of Things (IoT); lifetime maximization; mobile-edge computing (MEC); soft actor-critic (SAC)

Ask authors/readers for more resources

This article studies the problem of network lifetime optimization in a multiuser mobile-edge computing system and proposes a deep reinforcement learning method. The experimental results demonstrate that this method has a fast convergence rate and a small oscillation amplitude, and can significantly improve the network lifetime compared to other schemes.
This article studies the network lifetime optimization problem in a multiuser mobile-edge computing (MEC)-enabled Internet of Things (IoT) system comprising an access point (AP), a MEC server, and a set of K mobile devices (MDs) with limited battery capacity. Considering the residual battery energy at the MDs, stochastic task arrivals, and time-varying wireless fading channels, a soft actor-critic (SAC)-based deep reinforcement learning (DRL) lifetime maximization, called DeepLM, is proposed to jointly optimize the task splitting ratio, the local CPU-cycle frequencies at the MDs, the bandwidth allocation, and the CPU-cycle frequency allocation at the MEC server subject to the task queuing backlogs constraint, the bandwidth constraint, and maximum CPU-cycle frequency constraints at the MDs and the MEC server. Our results reveal that DeepLM enjoys a fast convergence rate and a small oscillation amplitude. We also compare the performance of DeepLM with three benchmark offloading schemes, namely, fully edge computing (FEC), fully local computing (FLC), and random computation offloading (RCO). DeepLM increases the network lifetime by 496% and 229% compared to the FLC and RCO schemes. Interestingly, it achieves such a colossal lifetime improvement when its nonbacklog probability is 0.99, while that of FEC, FLC, and RCO is 0.69, 0.53, and 0.25, respectively, showing a significant performance gain of 30%, 46%, and 74%.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available