4.6 Article

Intrusion detection based on improved density peak clustering for imbalanced data on sensor-cloud systems

期刊

JOURNAL OF SYSTEMS ARCHITECTURE
卷 118, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.sysarc.2021.102212

关键词

Density peak clustering; Network intrusion detection systems; Intrusion detection evaluation; Sensor-cloud systems; Network security

资金

  1. National Natural Science Foundation of China [61673186, 61972010, 61975124, 62006029]
  2. open project of Provincial Key Laboratory for Computer Information Processing Technology, Soochow University [KJS1839]
  3. Quanzhou City Science AMP
  4. Technology Program of China [2018C114R]
  5. Natural Science Foundation of Chongqing (China) [cstc2019jcyj-msxmX0683]
  6. Chongqing Municipal Education Commission (China) [KJQN202001434]
  7. Open Project of Beijing Key Laboratory of Big Data Technology for Food Safety [BTBD-2019KF06]

向作者/读者索取更多资源

Intrusion detection is crucial for cloud computing security, and using the improved DPeak algorithm can better handle data imbalance and anomalies, resulting in higher accuracy.
Intrusion detection has been extremely important for the security of the cloud computing environment for these years. However, it is also extremely hard to prevent network systems from attacking, due to that the attacking data and normal data usually have large different density distributions, i.e, they are imbalanced. Clustering is one of the effective methods for intrusion detection. Density Peak (DPeak) is a famous clustering algorithm that maps data of arbitrary dimension onto two dimensions, and it can automatically distinguish density centers and noise. However, it is not appropriate for applying DPeak in detecting intrusion data directly. Because (1) sparse regions are difficult to be identified and (2) most points in dense regions of imbalanced data are highly possible misclassified as outliers. Hence, an improved DPeak, namely Rotation-DPeak, is proposed to overcome them according to a simple assumption: the higher density of a point p, the larger delta it should have such that p can be picked as a density peak. Then, a novel strategy is invented to select density peaks by quadratic curve, rather than by choosing points with the largest gamma (gamma = rho x delta) or by drawing a rectangle on the decision graph. In addition, it is found that abnormal data usually leads to bad performance for intrusion detection, therefore we propose an outlier detection algorithm to identify anomaly traffic. Experiments prove that the proposed algorithm works well on imbalanced datasets, and is suitable for intrusion detection, which has a good performance in accuracy.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据