期刊
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT
卷 17, 期 2, 页码 946-957出版社
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNSM.2020.2964003
关键词
Security; Repeaters; Resource management; Heuristic algorithms; Optical fiber networks; Network architecture; Data communication; Quantum key distribution networks; online multi-tenant provisioning; heuristics; reinforcement learning
资金
- National Natural Science Foundation of China [61601052, 61822105]
- Fundamental Research Funds for the Central Universities [2019XD-A05]
- State Key Laboratory of Information Photonics and Optical Communications of China [IPOC2019ZR01]
- China Association for Science and Technology
- Swedish Research Council
- Swedish Foundation for Strategic Research
- Swedish Foundation for International Collaboration in Research and Higher Education
- Goran Gustafsson Foundation
Quantum key distribution (QKD) networks are potential to be widely deployed in the immediate future to provide long-term security for data communications. Given the high price and complexity, multi-tenancy has become a cost-effective pattern for QKD network operations. In this work, we concentrate on addressing the online multi-tenant provisioning (On-MTP) problem for QKD networks, where multiple tenant requests (TRs) arrive dynamically. On-MTP involves scheduling multiple TRs and assigning non-reusable secret keys derived from a QKD network to multiple TRs, where each TR can be regarded as a high-security-demand organization with the dedicated secret-key demand. The quantum key pools (QKPs) are constructed over QKD network infrastructure to improve management efficiency for secret keys. We model the secret-key resources for QKPs and the secret-key demands of TRs using distinct images. To realize efficient On-MTP, we perform a comparative study of heuristics and reinforcement learning (RL) based On-MTP solutions, where three heuristics (i.e., random, fit, and best-fit based On-MTP algorithms) are presented and a RL framework is introduced to realize automatic training of an On-MTP algorithm. The comparative results indicate that with sufficient training iterations the RL-based On-MTP algorithm significantly outperforms the presented heuristics in terms of tenant-request blocking probability and secret-key resource utilization.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据