3.8 Proceedings Paper

A High Performance Intrusion Detection System Using LightGBM Based on Oversampling and Undersampling

Journal

Publisher

SPRINGER INTERNATIONAL PUBLISHING AG
DOI: 10.1007/978-3-031-13870-6_53

Keywords

Intrusion detection systems; Resampling; LightGBM; Autoencoder

Funding

  1. National Natural Science Foundation of China [61862007]
  2. Guangxi Natural Science Foundation [2020GXNSFBA297103]

Ask authors/readers for more resources

This paper proposes the use of ADASYN and random under sampling techniques to address data imbalance in intrusion detection systems, achieving balanced datasets through under and over sampling, feature extraction with SAE, and classification with LightGBM. Experimental results demonstrate the superiority of this approach over existing methods.
Intrusion detection system plays an important role in network security, however, the problem with data imbalance limits the detection ability of intrusion detection system. In order to improve the performance of intrusion detection system, this paper proposes to use the adaptive synthetic sampling technique (ADASYN) and random under sampling technique to alleviate the problem of data imbalance in intrusion detection. Firstly, the majority class samples in the dataset are removed by undersampling technology and the minority class samples are oversampled, so the samples can reach a balanced state. Subsequently, a sparse autoencoder (SAE) extracts features from the resampled data to fit the original sample as closely as possible. Finally, LightGBM is applied on the processed dataset for the classification process. Multi-classification experiments were conducted on KDD99 and UNSWNB15 datasets. We compare six models' performance and find LightGBM is superior to other models. Furthermore, we also compare existing methods and the results show that our proposed method outperforms current methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available