4.5 Article

Adaptive Protection of Scientific Backbone Networks Using Machine Learning

Journal

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT
Volume 18, Issue 1, Pages 1064-1076

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNSM.2021.3050964

Keywords

Bandwidth; Government; Quality of service; Adaptive systems; Service level agreements; Predictive models; Licenses; Quality of Service (QoS); availability; traffic prediction; energy sciences network; service level agreement (SLA); deep learning

Funding

  1. High Speed Networks Laboratory (HSNLab)
  2. National Research, Development and Innovation Fund of Hungary [FK_17, KH_18, K_17, K_18, TKP2020, 123957, 129589, 124171, 134604, 128062]
  3. TKP2020, Institutional Excellence Program of the National Research Development and Innovation Office in the field of Artificial Intelligence (BME IE-MI-SC TKP2020)
  4. NSF [2018754]
  5. COST (European Cooperation in Science and Technology) [CA15127]
  6. U.S. Department of Energy [DE-AC02-05CH11231]
  7. Office of Advanced Cyberinfrastructure (OAC)
  8. Direct For Computer & Info Scie & Enginr [2018754] Funding Source: National Science Foundation

Ask authors/readers for more resources

This article proposes a new protection scheme for backbone networks that utilizes Machine Learning to implement network intelligence and achieve a proactive approach to service availability without the need for reserved backup network resources. By reallocating unused capacity as protection bandwidth, the scheme aims to improve availability without affecting operational connections or the over-provisioning of network bandwidth.
In this article, we propose a new protection scheme for backbone networks to guarantee high service availability. The presented scheme does not require any reconfiguration immediately after the failure (i.e., it is proactive). At the same time, it does not require any reserved backup network resources either. To achieve these seemingly contradictory goals, we utilize the recent advancements in Machine Learning (ML) to implement a network intelligence that periodically re-allocates the unused capacity as protection bandwidth to meet the service availability requirements of each connection. Our goal is achieved by two components (1) predicting the traffic for the next period on each link, and (2) intelligently selecting the best fit dedicated protection scheme for the next period depending on the estimated unused (spare) bandwidth and the previous service availability violations. Note that re-allocating protection bandwidth affects neither the operational connections nor the current best practice of operators to over-provision network bandwidth to support elephant flows. Finally, we provide a case study on the real traffic from Energy Sciences Network (ESnet), a high-speed, international scientific backbone network. The key benefit of our framework is that adaptively utilizing the over-provisioned bandwidth for spare capacity is sufficient to improve the availability from three-nines to five-nines (in ESnet for the 30 examined connections). The drawback is negligible bandwidth limitations; the user perceives a minor and very temporal bandwidth limitation in less than 0.1% of the time.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available