4.7 Article

Distributed UAV-BSs Trajectory Optimization for User-Level Fair Communication Service With Multi-Agent Deep Reinforcement Learning

期刊

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY
卷 70, 期 12, 页码 12290-12301

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TVT.2021.3117792

关键词

Vehicle dynamics; Trajectory; Wireless communication; Throughput; Base stations; Reinforcement learning; Unmanned aerial vehicles; UAV communication; deep reinforcement learning; UAV control; fairness

资金

  1. National Key Research and Development Plan [2017YFC0821003-2]
  2. Dalian Science and Technology Innovation Fund [2019J11CY004, 2020JJ26GX037]
  3. project of Shenzhen Science and Technology Innovation Committee [JCYJ20190809145407809]
  4. Open fund of State Key Laboratory of Acoustics [SKLA202102]
  5. National Natural Science Foundation of China [62072072]

向作者/读者索取更多资源

The study proposed a distributed UAV-BS control approach based on multi-agent deep reinforcement learning, which can improve the fairness of communication service by sacrificing a small amount of throughput. By designing the trajectory of UAV-BSs, it addressed the fairness issue at user-level and achieved weighted-throughput maximization.
Unmanned Aerial Vehicles (UAVs) have attacted much attention in the field of wireless communication due to its agility and altitude. UAVs can be used as low-altitude aerial base stations (UAV-BSs) to provide communication services for ground devices (GDs) in various scenarios, such as emergency communication and traffic offloading in hotspots. However, due to the limited communication ranges and high prices of commercial UAV-BSs, covering a target area all the time with sufficient UAVs is quite challenging, especially under dynamic environment. We need to design the trajectory of the UAV-BSs to optimize system performance. Most existing works focus on the energy-efficient coverage and throughput maximization but ignore the fairness of communication service, especially the fairness at user-level. Besides, reinforcement learning is suitable for solving decision problems in dynamic environments. However, most existing works use centralized deep reinforcement learning (DRL) approaches. Due to the scalability and low time complexity, a distributed DRL approach is more suitable for multiple UAV-BSs communication system in dynamic environment. Unlike previous works, we characterize the fairness at user-level based on proportional fairness scheduling and formulate a weighted-throughput maximization problem via designing UAV-BSs' trajectory. Then we model the dynamic deploymentproblem of UAV-BSs as a Markov game and propose a multi-agent deep reinforcement learning-based distributed UAV-BSs control approach named MAUC. MAUC approach adopts the framework of centralized training with distributed execution. Simulation results show that the MAUC can improve fairness of communication service by sacrificing a small amount of throughput.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据