期刊
出版社
IEEE
DOI: 10.1109/TrustCom/BigDataSE.2018.00108
关键词
flow data reduction; suspicious flow detection; hierarchical clustering; cluster sampling scheme
资金
- Key Laboratory of Network Assessment Technology, Chinese Academy of Sciences
- Beijing Key Laboratory of Network Security and Protection Technology [2015AA017202, 61702508]
Attacks like APT have lasted for a long time which need suspicious flow detection on long-time data. However, the challenge of effectively analyzing massive data source for suspicious flow diagnosis is unmet yet. Consequently, flow data reduction should be adopted, which refers to abstract the most relevant information from the massive dataset. Existing approaches to sampling flow data are inherently inaccurate unless running at high sampling rate. In this paper, we proposed HCBS (Hierarchical Clustering Based Sampling), a flow data reduction scheme, to alleviate such problems. We study the characteristics of flow data relating malicious activities and employ hierarchical clustering to sample data for further deep detection. Experiments on 1999 DARPA dataset demonstrates that HCBS reduces the size of the flow data by 40% with only a small loss in accuracy and significantly outperforms the compared state-of-the-art.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据