4.6 Article

Multi-Agent Deep Reinforcement Learning-Based Partial Task Offloading and Resource Allocation in Edge Computing Environment

期刊

ELECTRONICS
卷 11, 期 15, 页码 -

出版社

MDPI
DOI: 10.3390/electronics11152394

关键词

mobile edge computing; low latency; deep reinforcement learning; partial offloading; resource allocation

资金

  1. Jilin Province Scientific and Technological Planning Project of China [20210101415JC, YDZJ202201ZYTS556]
  2. Jilin Province Education Department Scientific Research Planning Foundation of China [JJKH20210753KJ]

向作者/读者索取更多资源

This paper introduces a mobile edge computing (MEC) framework in 5G wireless networks, which utilizes MEC servers to handle the computation tasks of wireless nodes, meeting the requirements of low latency and high reliability. The authors propose a deep reinforcement learning-based algorithm for task offloading and resource allocation, which achieves optimal decision-making policy through training neural networks. Simulation results show that this algorithm has good convergence and outperforms other baseline algorithms.
In the dense data communication environment of 5G wireless networks, with the dramatic increase in the amount of request computation tasks generated by intelligent wireless mobile nodes, its computation ability cannot meet the requirements of low latency and high reliability. Mobile edge computing (MEC) can utilize its servers with mighty computation power and closer to tackle the computation tasks offloaded by the wireless node (WN). The physical location of the MEC server is closer to WN, thereby meeting the requirements of low latency and high reliability. In this paper, we implement an MEC framework with multiple WNs and multiple MEC servers, which consider the randomness and divisibility of arrival request tasks from WN, the time-varying channel state between WN and MEC server, and different priorities of tasks. In the proposed MEC system, we present a decentralized multi-agent deep reinforcement learning-based partial task offloading and resource allocation algorithm (DeMADRL) to minimize the long-term weighted cost including delay cost and bandwidth cost. DeMADRL is a model-free scheme based on Double Deep Q-Learning (DDQN) and can obtain the optimal computation offloading and bandwidth allocation decision-making policy by training the neural networks. The comprehensive simulation results show that the proposed DeMADRL optimization scheme has a nice convergence and outperforms the other three baseline algorithms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据