☆ 4.7 Article

Feature selection by multi-objective optimisation: Application to network anomaly detection by hierarchical self-organising maps

KNOWLEDGE-BASED SYSTEMS (2014)

Journal

KNOWLEDGE-BASED SYSTEMS

Volume 71, Issue -, Pages 322-338

Publisher

ELSEVIER

DOI: 10.1016/j.knosys.2014.08.013

Keywords

Feature selection; Multi-objective optimisation; Unsupervised clustering; Growing self-organising maps; Network anomaly detection; IDS

Funding

FEDER funds
Ministerio de Ciencia e Innovacion of the Spanish Government [TIN2012-32039]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Feature selection is an important and active issue in clustering and classification problems. By choosing an adequate feature subset, a dataset dimensionality reduction is allowed, thus contributing to decreasing the classification computational complexity, and to improving the classifier performance by avoiding redundant or irrelevant features. Although feature selection can be formally defined as an optimisation problem with only one objective, that is, the classification accuracy obtained by using the selected feature subset, in recent years, some multi-objective approaches to this problem have been proposed. These either select features that not only improve the classification accuracy, but also the generalisation capability in case of supervised classifiers, or counterbalance the bias toward lower or higher numbers of features that present some methods used to validate the clustering/classification in case of unsupervised classifiers. The main contribution of this paper is a multi-objective approach for feature selection and its application to an unsupervised clustering procedure based on Growing Hierarchical Self-Organising Maps (GHSOMs) that includes a new method for unit labelling and efficient determination of the winning unit. In the network anomaly detection problem here considered, this multi-objective approach makes it possible not only to differentiate between normal and anomalous traffic but also among different anomalies. The efficiency of our proposals has been evaluated by using the well-known DARPA/NSL-KDD datasets that contain extracted features and labelled attacks from around 2 million connections. The selected feature sets computed in our experiments provide detection rates up to 99.8% with normal traffic and up to 99.6% with anomalous traffic, as well as accuracy values up to 99.12%. (C) 2014 Elsevier B.V. All rights reserved.

Feature selection by multi-objective optimisation: Application to network anomaly detection by hierarchical self-organising maps

Journal

KNOWLEDGE-BASED SYSTEMS

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Feature selection by multi-objective optimisation: Application to network anomaly detection by hierarchical self-organising maps

Journal

KNOWLEDGE-BASED SYSTEMS

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper