4.8 Article

Soft Actor-Critic-Based Computation Offloading in Multiuser MEC-Enabled IoT-A Lifetime Maximization Perspective

期刊

IEEE INTERNET OF THINGS JOURNAL
卷 10, 期 20, 页码 17571-17584

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2023.3277753

关键词

Deep reinforcement learning (DRL); Internet of Things (IoT); lifetime maximization; mobile-edge computing (MEC); soft actor-critic (SAC)

向作者/读者索取更多资源

This article studies the problem of network lifetime optimization in a multiuser mobile-edge computing system and proposes a deep reinforcement learning method. The experimental results demonstrate that this method has a fast convergence rate and a small oscillation amplitude, and can significantly improve the network lifetime compared to other schemes.
This article studies the network lifetime optimization problem in a multiuser mobile-edge computing (MEC)-enabled Internet of Things (IoT) system comprising an access point (AP), a MEC server, and a set of K mobile devices (MDs) with limited battery capacity. Considering the residual battery energy at the MDs, stochastic task arrivals, and time-varying wireless fading channels, a soft actor-critic (SAC)-based deep reinforcement learning (DRL) lifetime maximization, called DeepLM, is proposed to jointly optimize the task splitting ratio, the local CPU-cycle frequencies at the MDs, the bandwidth allocation, and the CPU-cycle frequency allocation at the MEC server subject to the task queuing backlogs constraint, the bandwidth constraint, and maximum CPU-cycle frequency constraints at the MDs and the MEC server. Our results reveal that DeepLM enjoys a fast convergence rate and a small oscillation amplitude. We also compare the performance of DeepLM with three benchmark offloading schemes, namely, fully edge computing (FEC), fully local computing (FLC), and random computation offloading (RCO). DeepLM increases the network lifetime by 496% and 229% compared to the FLC and RCO schemes. Interestingly, it achieves such a colossal lifetime improvement when its nonbacklog probability is 0.99, while that of FEC, FLC, and RCO is 0.69, 0.53, and 0.25, respectively, showing a significant performance gain of 30%, 46%, and 74%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据