4.7 Article

Intelligent Radio Access Network Slicing for Service Provisioning in 6G: A Hierarchical Deep Reinforcement Learning Approach

Journal

IEEE TRANSACTIONS ON COMMUNICATIONS
Volume 69, Issue 9, Pages 6063-6078

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCOMM.2021.3090423

Keywords

Quality of service; Network slicing; Stochastic processes; Resource management; Optimization; Mathematical model; Dynamic scheduling; Network slicing; 5G beyond; 6G; deep reinforcement learning; and radio resource management

Funding

  1. Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Program [RGPIN2018-06254]
  2. Canada Research Chair Program

Ask authors/readers for more resources

This paper proposes an intelligent RAN slicing strategy with two-layered control granularity to maximize the long-term QoS of services and spectrum efficiency of slices. The proposed method consists of an upper-level controller for ensuring QoS performance and a lower-level controller for improving SE of slices. By utilizing a model-free deep reinforcement learning framework, the effectiveness of the proposed intelligent RAN slicing scheme is confirmed through simulation results.
Network slicing is a key paradigm in 5G and is expected to be inherited in future 6G networks for the concurrent provisioning of diverse quality of service (QoS). Unfortunately, effective slicing of Radio Access Networks (RAN) is still challenging due to time-varying network situations. This paper proposes a new intelligent RAN slicing strategy with two-layered control granularity, which aims at maximizing both the long-term QoS of services and spectrum efficiency (SE) of slices. The proposed method consists of an upper-level controller to ensure the QoS performance, which enforces loose control by performing adaptive slice configuration according to the long-term dynamics of service traffic. The lower-level controller is to improve SE of slices, by tightly scheduling radio resources to users at the small time-scale. To realize the proposed RAN slicing strategy, we propose a model-free deep reinforcement learning (DRL) framework, which is a hierarchical structure that collaboratively integrating the modified deep deterministic policy gradient (DDPG) and double deep-Q-network algorithm. Specifically, the lower-level control problem is a mixed-integer stochastic optimization problem with multiple constraints. This kind of problem is hard to be directly solved by the exiting DRL algorithms, since it involves searching for the solution in a vast set of mixed-integer action space, which will induce unbearable computational complexity. Thus, we propose a novel action space reducing approach, embedding the convex optimization tools into the DDPG algorithm, to speed up the lower-level control. Furthermore, simulation results confirm the effectiveness of our proposed intelligent RAN slicing scheme.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available