4.5 Article

Email Filtering Using Hybrid Feature Selection Model

Journal

CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES
Volume 132, Issue 2, Pages 435-450

Publisher

TECH SCIENCE PRESS
DOI: 10.32604/cmes.2022.020088

Keywords

Feature selection; artificial bee colony; ant colony optimization; particle swarm optimization; spam detection; emails filtering

Ask authors/readers for more resources

This research proposes two models for spam detection and feature selection. The first model evaluates the dataset by reducing the number of keywords, and the results are promising. The second model creates features for spam detection and reduces the number of features using three metaheuristic algorithms, with highly significant outcomes.
Undoubtedly, spam is a serious problem, and the number of spam emails is increased rapidly. Besides, the massive number of spam emails prompts the need for spam detection techniques. Several methods and algorithms are used for spam filtering. Also, some emergent spam detection techniques use machine learning methods and feature extraction. Some methods and algorithms have been introduced for spam detecting and filtering. This research proposes two models for spam detection and feature selection. The first model is evaluated with the email spam classification dataset, which is based on reducing the number of keywords to its minimum. The results of this model are promising and highly acceptable. The second proposed model is based on creating features for spam detection as a first stage. Then, the number of features is reduced using three well-known metaheuristic algorithms at the second stage. The algorithms used in the second model are Artificial Bee Colony (ABC), Ant Colony Optimization (ACO), and Particle Swarm Optimization (PSO), and these three algorithms are adapted to fit the proposed model. Also, the authors give it the names AABC, AACO, and APSO, respectively. The dataset used for the evaluation of this model is Enron. Finally, well-known criteria are used for the evaluation purposes of this model, such as true positive, false positive, false negative, precision, recall, and F-Measure. The outcomes of the second proposed model are highly significant compared to the first one.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available