4.4 Article

Intrusion detection system over real-time data traffic using machine learning methods with feature selection approaches

Journal

Publisher

SPRINGER
DOI: 10.1007/s10207-022-00616-4

Keywords

CICIDS2017; IDS; FSA; NSL-KDD; Classifiers

Ask authors/readers for more resources

The intrusion detection system (IDS) is crucial for extracting and analyzing network traffic to detect abnormal activity. However, emerging technologies generate large volumes of traffic that may contain irrelevant attributes. To address this issue, researchers have used feature selection approaches to remove non-relevant features and find important ones, and have investigated various classifiers to improve IDS performance.
The intrusion detection system (IDS) plays an important role in extracting and analysing the network traffics to detect aberrant activity. However, emerging technologies, like cloud computing, Internet of Things, etc., generate a large volume of traffics, which may carry the irrelevant attributes that do not have any impact on classification or in detection of assaults. Hence, it's became an open challenge for the researchers to extract the meaningful data from huge amounts of traffic and also to examine whether the selected features could increase IDS performance or not. To solve these issues, features selection approaches (FSA) have been used in this research to remove non-relevant features and find the important ones. Later, the various classifiers have been used to investigate the best classifier which could increase the performance of IDS's detection-engine on the NSL-KDD datasets. However, to validate, the investigated best-performing classifier with the suitable features selection technique (FST) has also been implemented on a real-time dataset, i.e. combined CICIDS2017. The experiment results in this research suggest that the acquired subset of relevant features under the proposed model's (Decision Tree + Recursive Feature Elimination) could increase the IDS performance with average accuracy of 99.21% and 99.94% on the well-known NSL-KDD and CICIDS2017 datasets, respectively, and could also minimize the computation cost, in parallel.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available