☆ 4.7 Article

Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS (2020)

期刊

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS

卷 19, 期 2, 页码 729-743

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TWC.2019.2935201

关键词

Resource management; Trajectory; Wireless communication; Communication networks; Dynamic scheduling; Stochastic processes; Reinforcement learning; Dynamic resource allocation; multi-agent reinforcement learning (MARL); stochastic games; UAV communications

类别

Engineering, Electrical & Electronic Telecommunications

资金

U.K. Engineering and Physical Sciences Research Council (EPSRC) [EP/N029720/2]
EPSRC [EP/N029720/2] Funding Source: UKRI

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Unmanned aerial vehicles (UAVs) are capable of serving as aerial base stations (BSs) for providing both cost-effective and on-demand wireless communications. This article investigates dynamic resource allocation of multiple UAVs enabled communication networks with the goal of maximizing long-term rewards. More particularly, each UAV communicates with a ground user by automatically selecting its communicating user, power level and subchannel without any information exchange among UAVs. To model the dynamics and uncertainty in environments, we formulate the long-term resource allocation problem as a stochastic game for maximizing the expected rewards, where each UAV becomes a learning agent and each resource allocation solution corresponds to an action taken by the UAVs. Afterwards, we develop a multi-agent reinforcement learning (MARL) framework that each agent discovers its best strategy according to its local observations using learning. More specifically, we propose an agent-independent method, for which all agents conduct a decision algorithm independently but share a common structure based on Q-learning. Finally, simulation results reveal that: 1) appropriate parameters for exploitation and exploration are capable of enhancing the performance of the proposed MARL based resource allocation algorithm; 2) the proposed MARL algorithm provides acceptable performance compared to the case with complete information exchanges among UAVs. By doing so, it strikes a good tradeoff between performance gains and information exchange overheads.

Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks

期刊

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks

期刊

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文