4.6 Article

Intrusion detection based on improved density peak clustering for imbalanced data on sensor-cloud systems

Journal

JOURNAL OF SYSTEMS ARCHITECTURE
Volume 118, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.sysarc.2021.102212

Keywords

Density peak clustering; Network intrusion detection systems; Intrusion detection evaluation; Sensor-cloud systems; Network security

Funding

  1. National Natural Science Foundation of China [61673186, 61972010, 61975124, 62006029]
  2. open project of Provincial Key Laboratory for Computer Information Processing Technology, Soochow University [KJS1839]
  3. Quanzhou City Science AMP
  4. Technology Program of China [2018C114R]
  5. Natural Science Foundation of Chongqing (China) [cstc2019jcyj-msxmX0683]
  6. Chongqing Municipal Education Commission (China) [KJQN202001434]
  7. Open Project of Beijing Key Laboratory of Big Data Technology for Food Safety [BTBD-2019KF06]

Ask authors/readers for more resources

Intrusion detection is crucial for cloud computing security, and using the improved DPeak algorithm can better handle data imbalance and anomalies, resulting in higher accuracy.
Intrusion detection has been extremely important for the security of the cloud computing environment for these years. However, it is also extremely hard to prevent network systems from attacking, due to that the attacking data and normal data usually have large different density distributions, i.e, they are imbalanced. Clustering is one of the effective methods for intrusion detection. Density Peak (DPeak) is a famous clustering algorithm that maps data of arbitrary dimension onto two dimensions, and it can automatically distinguish density centers and noise. However, it is not appropriate for applying DPeak in detecting intrusion data directly. Because (1) sparse regions are difficult to be identified and (2) most points in dense regions of imbalanced data are highly possible misclassified as outliers. Hence, an improved DPeak, namely Rotation-DPeak, is proposed to overcome them according to a simple assumption: the higher density of a point p, the larger delta it should have such that p can be picked as a density peak. Then, a novel strategy is invented to select density peaks by quadratic curve, rather than by choosing points with the largest gamma (gamma = rho x delta) or by drawing a rectangle on the decision graph. In addition, it is found that abnormal data usually leads to bad performance for intrusion detection, therefore we propose an outlier detection algorithm to identify anomaly traffic. Experiments prove that the proposed algorithm works well on imbalanced datasets, and is suitable for intrusion detection, which has a good performance in accuracy.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available