4.5 Article

Label flipping attacks against Naive Bayes on spam filtering systems

期刊

APPLIED INTELLIGENCE
卷 51, 期 7, 页码 4503-4514

出版社

SPRINGER
DOI: 10.1007/s10489-020-02086-4

关键词

Spam classification; Label flipping attacks; Naive Bayes classifier; Performance evaluation

资金

  1. Integration of Cloud Computing and Big Integration of Cloud Computing and Big Data, Innovation of Science and Education [2017A11017]
  2. Key Research, Development, and Dissemination Program of Henan Province (Science and Technology for the People) [182207310002]
  3. Key Science and Technology Project of Xinjiang Production and Construction Corps [2018AB017]

向作者/读者索取更多资源

Label flipping attack is a poisoning attack that reduces the classification performance of a model by flipping the labels of training samples. Naive Bayes algorithm demonstrates good robustness in handling issues like document classification and spam filtering. The proposed label flipping attacks effectively reduce the accuracy of various classification models.
Label flipping attack is a poisoning attack that flips the labels of training samples to reduce the classification performance of the model. Robustness is used to measure the applicability of machine learning algorithms to adversarial attack. Naive Bayes (NB) algorithm is a anti-noise and robust machine learning technique. It shows good robustness when dealing with issues such as document classification and spam filtering. Here we propose two novel label flipping attacks to evaluate the robustness of NB under label noise. For the three datasets of Spambase, TREC 2006c and TREC 2007 in the spam classification domain, our attack goal is to increase the false negative rate of NB under the influence of label noise without affecting normal mail classification. Our evaluation shows that at a noise level of 20%, the false negative rate of Spambase and TREC 2006c has increased by about 20%, and the test error of the TREC 2007 dataset has increased to nearly 30%. We compared the classification accuracy of five classic machine learning algorithms (random forest(RF), support vector machine(SVM), decision tree(DT), logistic regression(LR), and NB) and two deep learning models(AlexNet, LeNet) under the proposed label flipping attacks. The experimental results show that two label noises are suitable for various classification models and effectively reduce the accuracy of the models.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据