4.1 Article

Image spam filtering using convolutional neural networks

Journal

PERSONAL AND UBIQUITOUS COMPUTING
Volume 22, Issue 5-6, Pages 1029-1037

Publisher

SPRINGER LONDON LTD
DOI: 10.1007/s00779-018-1168-8

Keywords

Data augmentation; Convolutional neural networks; Image recognition; Image spam filtering

Funding

  1. National Youth Science Foundation project of China [F020101]
  2. Henan Province Science and Technology key Project [1521022101936]
  3. Natural Science Foundation of Hunan Province, China [2018JJ2023]
  4. Key projects of Science and Technology Research in Henan Education Department [15A520091, 17B520031]

Ask authors/readers for more resources

Spammers often embed text into images in order to avoid filtering by text-based spam filters, which result in a large number of advertisement spam images. Garbage image recognition has become one of the hotspots in the field of Internet spam filtering research. Its goal is to solve the problem that traditional spam information filtering methods encounter a sharp performance decline or even failure when filtering spam image information. Based on the clustering algorithm, this paper proposes a method to expand the data samples, which greatly improves the number of high-quality training samples and meets the needs of model training. Then, we train a convolutional neural networks using the enlarged data samples to recognize the SPAM in real time. The experimental results show that the accuracy of the model is increased by more than 14% after using the method of data augmentation. The accuracy of the model can be improved by 6% compared with other methods of data augmentation. Combined with convolutional neural networks and the proposed method of data augmentation, the accuracy of our SPAM filtering model is 7-11% higher than that of the traditional method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.1
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available