4.7 Article

RMDDQN-Learning: Computation Offloading Algorithm Based on Dynamic Adaptive Multi-Objective Reinforcement Learning in Internet of Vehicles

期刊

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY
卷 72, 期 9, 页码 11374-11388

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TVT.2023.3270967

关键词

Computation offloading; Internet of Vehicles; mobile edge computing; multi-objective reinforcement learning; radial basis neural networks

向作者/读者索取更多资源

Mobile edge computing (MEC) enables smart vehicles in the Internet of Vehicles (IoV) to offload computation-intensive tasks to edge devices, but faces challenges in minimizing delay and energy consumption while ensuring data privacy security and resource load balancing. We propose a novel multi-objective reinforcement learning (MORL) algorithm based on double deep Q-network (DDQN) to optimize computation offloading, and show through numerical experiments that it outperforms traditional reinforcement learning methods by reducing overall energy consumption by 30.24%.
As a promising computing paradigm driven by 5G, mobile edge computing (MEC) empowers smart vehicles to offload computation-intensive tasks to edge devices in the Internet of Vehicles (IoV), thereby providing a plethora of exciting applications (e.g. on-board AR/VR, autonomous driving, etc.) with the unique quality of service (QoS) guarantees. However, a key challenge of MEC is howto keep the delay and energy consumption to aminimum in the computation offloading process, while ensuring the privacy security of offloaded data and the load balancing of edge resources. Namely, howto simultaneously optimize multiple indicators that affect computation offloading, which creates a challenging multi-objective optimization (MOO) problem. Aiming at the MOO problem, we propose a novel multi-objective reinforcement learning (MORL) algorithm based on double deep Q-network (DDQN). Each DDQN agent obtains rewards on different objectives according to different reward functions, and dynamically approximates the optimal offloading decision on multiple objectives. To conquer the trade-off problem among multiple conflicting objectives, we propose a weight-learning network based on radial basis function (RBF) networks, which dynamically adjusts the weights by learning the value changes among the objectives. Interestingly, we discovered encouraging potential ofMORLfor solving computation offloading problems in IoV, and numerical results show that the proposed algorithm outperforms traditional reinforcement learning methods by 30.24% in terms of overall energy consumption.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据