4.6 Article

An Agile Approach to Identify Single and Hybrid Normalization for Enhancing Machine Learning-Based Network Intrusion Detection

Journal

IEEE ACCESS
Volume 9, Issue -, Pages 137494-137513

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3118361

Keywords

Intrusion detection; Mathematical models; Feature extraction; Training; Standards; Statistical analysis; Numerical models; Anomaly detection; Bot-IoT; CIC-IDS 2017; intrusion detection; IoT; ISCX-IDS 2012; normalization; NSL KDD; skewness; scaling; transformation; UNSW-NB15

Funding

  1. National Research Foundation of Korea (NRF) - Korean Government (MSIT) [NRF-2019R1F1A1062320]
  2. Ministry of Science and Information Communication Technology (ICT) (MSIT), South Korea [IITP-2021-2016-0-00313]

Ask authors/readers for more resources

This paper discusses the importance of intrusion detection in improving network security, the application in the field of machine learning, and the selection of suitable normalization methods for datasets.
Detecting intrusion in network traffic has remained a problematic task for years. Progress in the field of machine learning is paving the way for enhancing intrusion detection systems. Due to this progress intrusion detection has become an integral part of network security. Intrusion detection has achieved high detection accuracy with the help of supervised machine learning methods. A key factor in enhancing the performance of supervised classifiers is how data is augmented for training the classification model. Data in real-world networks or publicly available datasets are not always normally (Gaussian) distributed. Instead, the distributions of variables are more likely to be skewed. To achieve a high detection rate, data normalization or transformation plays an important role for machine learning-based intrusion detection systems. Several methods are available to normalize the attributes of the data before training a classification model. However, opting for the most suitable normalization technique is still a questionable task. In this paper, a statistical method is proposed that can identify the most suitable normalization method for the dataset. The normalization method identified by the proposed approach gives the highest accuracy for an intrusion detection system. To highlight the efficiency of the proposed method, five different datasets were used with two different feature selection methods. The datasets belong to both Internet of things and traditional network environments. The proposed method is also able to identify hybrid normalizations to achieve even improved intrusion detection results.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available