4.6 Article

Energy-Efficient UAV Movement Control for Fair Communication Coverage: A Deep Reinforcement Learning Approach

期刊

SENSORS
卷 22, 期 5, 页码 -

出版社

MDPI
DOI: 10.3390/s22051919

关键词

UAV; fairness; coverage score; reinforcement learning; actor-critic

向作者/读者索取更多资源

This paper introduces a novel distributed control solution called SBG-AC, which focuses on improving coverage score with minimum energy consumption and high fairness value by placing a group of UAVs in a candidate area. The state-based potential game is used to model the complex interactions, and the actor-critic algorithm is merged to ensure convergence and distributed control of each UAV with learning capabilities in dynamic environments. Simulation results show that SBG-AC outperforms other algorithms in terms of fairness, coverage score, and energy consumption.
Unmanned Aerial Vehicles (UAVs) are considered an important element in wireless communication networks due to their agility, mobility, and ability to be deployed as mobile base stations (BSs) in the network to improve the communication quality and coverage area. UAVs can be used to provide communication services for ground users in different scenarios, such as transportation systems, disaster situations, emergency cases, and surveillance. However, covering a specific area under a dynamic environment for a long time using UAV technology is quite challenging due to its limited energy resources, short communication range, and flying regulations and rules. Hence, a distributed solution is needed to overcome these limitations and to handle the interactions among UAVs, which leads to a large state space. In this paper, we introduced a novel distributed control solution to place a group of UAVs in the candidate area in order to improve the coverage score with minimum energy consumption and a high fairness value. The new algorithm is called the state-based game with actor-critic (SBG-AC). To simplify the complex interactions in the problem, we model SBG-AC using a state-based potential game. Then, we merge SBG-AC with an actor-critic algorithm to assure the convergence of the model, to control each UAV in a distributed way, and to have learning capabilities in case of dynamic environments. Simulation results show that the SBG-AC outperforms the distributed DRL and the DRL-EC3 in terms of fairness, coverage score, and energy consumption.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据