4.8 Article

Joint Trajectory Planning, Application Placement, and Energy Renewal for UAV-Assisted MEC: A Triple-Learner-Based Approach

期刊

IEEE INTERNET OF THINGS JOURNAL
卷 10, 期 15, 页码 13622-13636

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2023.3262687

关键词

Long-term optimization; mobile-edge computing (MEC); reinforcement learning; stochastic game; unmanned aerial vehicle (UAV)

向作者/读者索取更多资源

This article studies an energy-efficient scheduling problem for multiple UAV-assisted MEC. It formulates a joint optimization problem of UAVs' trajectory planning, energy renewal, and application placement. It proposes a TLRL approach to reach equilibriums in the optimization problem and analyzes the convergence and complexity of the proposed solution. Simulations demonstrate the superiority of the TLRL approach over counterparts.
In this article, an energy-efficient scheduling problem for multiple unmanned aerial vehicle (UAV)-assisted mobile-edge computing (MEC) is studied. In the considered model, UAVs act as mobile edge servers to provide computing services to end-users with task offloading requests. Unlike existing works, we allow UAVs to determine not only their trajectories but also the decisions of whether returning to the depot for replenishing energies and updating application placements (due to their limited batteries and storage capacities). With the aim of maximizing the long-term energy efficiency of all UAVs, i.e., the total amount of offloaded tasks computed by all UAVs over their total energy consumption, a joint optimization of UAVs' trajectory planning, energy renewal, and application placement is formulated. Taking into account the underlying cooperation and competition among intelligent UAVs, we reformulate such optimization problem as three coupled multiagent stochastic games. Since the prior environment information is unavailable to UAVs, we propose a novel triple-learner-based reinforcement learning (TLRL) approach, integrating a trajectory learner, an energy learner, and an application learner, for reaching equilibriums. Moreover, we analyze the convergence and the complexity of the proposed solution. Simulations are conducted to evaluate the performance of the proposed TLRL approach, and demonstrate its superiority over counterparts.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据