4.7 Article

Fuzziness based semi-supervised learning approach for intrusion detection system

Journal

INFORMATION SCIENCES
Volume 378, Issue -, Pages 484-497

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2016.04.019

Keywords

Fuzziness; Divide-and-conquer strategy; Semi-supervised learning; Intrusion detection; Random weight neural network

Funding

  1. Deanship of Scientific Research at King Saud University [RG-1435-048]
  2. China Postdoctoral Science Foundation [2015M572361]
  3. Basic Research Project of Knowledge Innovation Program in Shenzhen [JCYJ20150324140036825]
  4. National Natural Science Foundations of China [61503252, 71371063]

Ask authors/readers for more resources

Countering cyber threats, especially attack detection, is a challenging area of research in the field of information assurance. Intruders use polymorphic mechanisms to masquerade the attack payload and evade the detection techniques Many supervised and unsupervised learning approaches from the field of machine learning and pattern recognition have been used to increase the efficacy of intrusion detection systems (IDSs). Supervised learning approaches use only labeled samples to train a classifier, but obtaining sufficient labeled samples is cumbersome, and requires the efforts of domain experts. However, unlabeled samples can easily be obtained in many real world problems. Compared to supervised learning approaches, semi-supervised learning (SSL) addresses this issue by considering large amount of unlabeled samples together with the labeled samples to build a better classifier. This paper proposes a novel fuzziness based semi-supervised learning approach by utilizing unlabeled samples assisted with supervised learning algorithm to improve the classifier's performance for the IDSs. A single hidden layer feed-forward neural network (SLFN) is trained to output a fuzzy membership vector, and the sample categorization (low, mid, and high fuzziness categories) on unlabeled samples is performed using the fuzzy quantity. The classifier is retrained after incorporating each category separately into the original training set. The experimental results using this technique of intrusion detection on the NSL-KDD dataset show that unlabeled samples belonging to low and high fuzziness groups make major contributions to improve the classifier's performance compared to existing classifiers e.g., naive bayes, support vector machine, random forests, etc. (C) 2016 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available