3.8 Proceedings Paper

Hierarchical Clustering Based Network Traffic Data Reduction for Improving Suspicious Flow Detection

Publisher

IEEE
DOI: 10.1109/TrustCom/BigDataSE.2018.00108

Keywords

flow data reduction; suspicious flow detection; hierarchical clustering; cluster sampling scheme

Funding

  1. Key Laboratory of Network Assessment Technology, Chinese Academy of Sciences
  2. Beijing Key Laboratory of Network Security and Protection Technology [2015AA017202, 61702508]

Ask authors/readers for more resources

Attacks like APT have lasted for a long time which need suspicious flow detection on long-time data. However, the challenge of effectively analyzing massive data source for suspicious flow diagnosis is unmet yet. Consequently, flow data reduction should be adopted, which refers to abstract the most relevant information from the massive dataset. Existing approaches to sampling flow data are inherently inaccurate unless running at high sampling rate. In this paper, we proposed HCBS (Hierarchical Clustering Based Sampling), a flow data reduction scheme, to alleviate such problems. We study the characteristics of flow data relating malicious activities and employ hierarchical clustering to sample data for further deep detection. Experiments on 1999 DARPA dataset demonstrates that HCBS reduces the size of the flow data by 40% with only a small loss in accuracy and significantly outperforms the compared state-of-the-art.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available