☆ 4.7 Article

Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY (2020)

Journal

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY

Volume 69, Issue 2, Pages 1828-1840

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TVT.2019.2961405

Keywords

Device-to-device (D2D) communications; multi-agent deep reinforcement learning; spectrum allocation

Funding

National Natural Science Foundation of China [61571062, 61871047]
Natural Science Foundation of Beijing Municipality [4202049]
National Key R&D Program of China [2018YFB1800805]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Device-to-device (D2D) communication underlay cellular networks is a promising technique to improve spectrum efficiency. In this situation, D2D transmission may cause severe interference to both the cellular and other D2D links, which imposes a great technical challenge to spectrum allocation. Existing centralized schemes require global information, which causes a large signaling overhead. While existing distributed schemes requires frequent information exchange among D2D users and cannot achieve global optimization. In this paper, a distributed spectrum allocation framework based on multi-agent deep reinforcement learning is proposed, named multi-agent actor critic (MAAC). MAAC shares global historical states, actions and policies during centralized training, requires no signal interaction during execution and utilizes cooperation among users to further optimize system performance. Moreover, in order to decrease the computing complexity of the training, we further propose the neighbor-agent actor critic (NAAC) based on the neighbor users' historical information for centralized training. The simulation results show that the proposed MAAC and NAAC can effectively reduce the outage probability of cellular links, greatly improve the sum rate of D2D links and converge quickly.

Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications

Journal

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications

Journal

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper