☆ 4.7 Article

Multi-Tenant Cross-Slice Resource Orchestration: A Deep Reinforcement Learning Approach

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS (2019)

期刊

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS

卷 37, 期 10, 页码 2377-2392

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/JSAC.2019.2933893

关键词

Network slicing; radio access networks; mobile-edge computing; packet scheduling; Markov decision process; deep reinforcement learning

类别

Engineering, Electrical & Electronic Telecommunications

资金

Academy of Finland [319759, 319758, 289611]
National Key R&D Program of China [2017YFB1301003]
National Natural Science Foundation of China [61701439, 61731002]
Zhejiang Key Research and Development Plan [2019C01002]
Japan Society for the Promotion of Science (JSPS) KAKENHI [18KK0279]
Telecommunications Advanced Foundation
Academy of Finland (AKA) [289611, 319758, 319759, 319758, 319759, 289611] Funding Source: Academy of Finland (AKA)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

With the cellular networks becoming increasingly agile, a major challenge lies in how to support diverse services for mobile users (MUs) over a common physical network infrastructure. Network slicing is a promising solution to tailor the network to match such service requests. This paper considers a system with radio access network (RAN)-only slicing, where the physical infrastructure is split into slices providing computation and communication functionalities. A limited number of channels are auctioned across scheduling slots to MUs of multiple service providers (SPs) (i.e., the tenants). Each SP behaves selfishly to maximize the expected long-term payoff from the competition with other SPs for the orchestration of channels, which provides its MUs with the opportunities to access the computation and communication slices. This problem is modelled as a stochastic game, in which the decision makings of a SP depend on the global network dynamics as well as the joint control policy of all SPs. To approximate the Nash equilibrium solutions, we first construct an abstract stochastic game with the local conjectures of channel auction among the SPs. We then linearly decompose the per-SP Markov decision process to simplify the decision makings at a SP and derive an online scheme based on deep reinforcement learning to approach the optimal abstract control policies. Numerical experiments show significant performance gains from our scheme.

Multi-Tenant Cross-Slice Resource Orchestration: A Deep Reinforcement Learning Approach

期刊

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multi-Tenant Cross-Slice Resource Orchestration: A Deep Reinforcement Learning Approach

期刊

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文