☆ 4.7 Article

CNN-Fusion: An effective and lightweight phishing detection method based on multi-variant ConvNet

INFORMATION SCIENCES (2023)

期刊

INFORMATION SCIENCES

卷 631, 期 -, 页码 328-345

出版社

ELSEVIER SCIENCE INC

DOI: 10.1016/j.ins.2023.02.039

关键词

Phishing detection; Deep learning; Convolutional neural network; Phishing attacks; Malicious websites

类别

Computer Science, Information Systems

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Phishing scams are on the rise and require rapid, precise, and low-cost prevention measures. CNN-Fusion, a character-level convolutional neural network, is proposed as an effective and lightweight method for detecting phishing URLs. It utilizes parallel one-layer CNN variants with different-sized kernels and applies techniques like SpatialDropout1D and max-over time pooling to enhance its robustness and feature selection. Evaluation on publicly available datasets and against AI adversarial attacks shows superior performance compared to existing methods with significantly reduced training time and memory consumption, achieving an average accuracy above 99%.

Phishing scams are increasing as the technical skills and costs of phishing attacks diminish, emphasizing the need for rapid, precise, and low-cost prevention measures. Based on a character-level convolutional neural network (CNN), we present CNN-Fusion, an effective and lightweight phishing URL detection method. Our basic idea is to deploy multiple variants of one-layer CNN with various-sized kernels in parallel to extract multi-level features. Observing that differences between phishing and benign URLs might exhibit a strong spatial correlation, we choose SpatialDropout1D, making the model more robust and preventing it from memorizing the training data. To further reduce the probability of errors that may cause by irrelevant or noisy features, we apply a max-over time pooling technique over the feature map to pick only the most important feature. Finally, the model is evaluated using five publicly available datasets containing 1.85 million phishing and benign URLs. Other than that, we assess the model against AI adversarial attacks, known as Offensive AI. Compared to existing methods, experiments demonstrate that our approach enjoys advantages in 5 times less training time and much more in memory consumption, achieving an average accuracy above 99% on five different datasets as well as on AI-generated malicious attacks.

CNN-Fusion: An effective and lightweight phishing detection method based on multi-variant ConvNet

期刊

INFORMATION SCIENCES

出版社

ELSEVIER SCIENCE INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

CNN-Fusion: An effective and lightweight phishing detection method based on multi-variant ConvNet

期刊

INFORMATION SCIENCES

出版社

ELSEVIER SCIENCE INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文