3.8 Proceedings Paper

A Security-Constrained Reinforcement Learning Framework for Software Defined Networks

出版社

IEEE
DOI: 10.1109/ICC42927.2021.9500763

关键词

Security and Safety; Deep Reinforcement Learning; Software Defined Networks

资金

  1. U.S. Army Research Laboratory
  2. U.K. Ministry of Defence [W911NF-16-3-0001]
  3. U.S. Army Research Office [W911NF1910432]
  4. U.S. Department of Defense (DOD) [W911NF1910432] Funding Source: U.S. Department of Defense (DOD)

向作者/读者索取更多资源

Reinforcement Learning is an effective technique for building 'smart' SDN controllers, but may pose security risks during exploration. The Jarvis-SDN framework proposed in this paper focuses on learning intelligent policies to maximize functionality while ensuring security.
Reinforcement Learning (RL) is an effective technique for building 'smart' SDN controllers because of its model-free nature and ability to learn policies online without requiring extensive training data. However, as RL agents are geared to maximize functionality and explore the environment without constraints, security can be breached. In this paper, we propose Jarvis-SDN, a RL framework that constrains explorations by taking security into account. In Jarvis-SDN, the RL agent learns 'intelligent policies' which maximize functionality but not at the cost of security. Standard network flow based attack signatures obtained from intrusion detection system (IDS) datasets cannot be used as policies because they do not conform to the state model of the RL framework and thus have poor accuracy and high false positives. To address such issue, the security policies for constraining explorations in Jarvis-SDN are learnt in a semi-supervised manner in the form of 'partial attack signatures' from packet captures of IDS datasets that are then encoded in the objective function of the RL based optimization framework. These signatures are learnt using Deep Q-Networks (DQN). Our analysis shows that DQN based attack signatures perform better than classical machine learning techniques, like decision trees, random forests and deep neural networks (DNN), for common network attacks. We instantiate our framework for a SDN controller with the goal of intelligent rate control to further analyze the effectiveness of the attack signatures.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据