4.5 Article

Multi-Tenant Provisioning for Quantum Key Distribution Networks With Heuristics and Reinforcement Learning: A Comparative Study

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNSM.2020.2964003

Keywords

Security; Repeaters; Resource management; Heuristic algorithms; Optical fiber networks; Network architecture; Data communication; Quantum key distribution networks; online multi-tenant provisioning; heuristics; reinforcement learning

Funding

  1. National Natural Science Foundation of China [61601052, 61822105]
  2. Fundamental Research Funds for the Central Universities [2019XD-A05]
  3. State Key Laboratory of Information Photonics and Optical Communications of China [IPOC2019ZR01]
  4. China Association for Science and Technology
  5. Swedish Research Council
  6. Swedish Foundation for Strategic Research
  7. Swedish Foundation for International Collaboration in Research and Higher Education
  8. Goran Gustafsson Foundation

Ask authors/readers for more resources

Quantum key distribution (QKD) networks are potential to be widely deployed in the immediate future to provide long-term security for data communications. Given the high price and complexity, multi-tenancy has become a cost-effective pattern for QKD network operations. In this work, we concentrate on addressing the online multi-tenant provisioning (On-MTP) problem for QKD networks, where multiple tenant requests (TRs) arrive dynamically. On-MTP involves scheduling multiple TRs and assigning non-reusable secret keys derived from a QKD network to multiple TRs, where each TR can be regarded as a high-security-demand organization with the dedicated secret-key demand. The quantum key pools (QKPs) are constructed over QKD network infrastructure to improve management efficiency for secret keys. We model the secret-key resources for QKPs and the secret-key demands of TRs using distinct images. To realize efficient On-MTP, we perform a comparative study of heuristics and reinforcement learning (RL) based On-MTP solutions, where three heuristics (i.e., random, fit, and best-fit based On-MTP algorithms) are presented and a RL framework is introduced to realize automatic training of an On-MTP algorithm. The comparative results indicate that with sufficient training iterations the RL-based On-MTP algorithm significantly outperforms the presented heuristics in terms of tenant-request blocking probability and secret-key resource utilization.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available