☆ 4.6 Article

An Effective Phishing Detection Model Based on Character Level Convolutional Neural Network from URL

ELECTRONICS (2020)

期刊

ELECTRONICS

卷 9, 期 9, 页码 -

出版社

MDPI

DOI: 10.3390/electronics9091514

关键词

phishing detection; URL features engineering; character embedding; deep learning

类别

Computer Science, Information Systems Engineering, Electrical & Electronic Physics, Applied

资金

Key-Area Research and Development Program of Guangdong Province [2019B010137002]
National Natural Science Foundation of China [61902385]
China Postdoctoral Science Foundation [2020M672892]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Phishing is the easiest way to use cybercrime with the aim of enticing people to give accurate information such as account IDs, bank details, and passwords. This type of cyberattack is usually triggered by emails, instant messages, or phone calls. The existing anti-phishing techniques are mainly based on source code features, which require to scrape the content of web pages, and on third-party services which retard the classification process of phishing URLs. Although the machine learning techniques have lately been used to detect phishing, they require essential manual feature engineering and are not an expert at detecting emerging phishing offenses. Due to the recent rapid development of deep learning techniques, many deep learning-based methods have also been introduced to enhance the classification performance. In this paper, a fast deep learning-based solution model, which uses character-level convolutional neural network (CNN) for phishing detection based on the URL of the website, is proposed. The proposed model does not require the retrieval of target website content or the use of any third-party services. It captures information and sequential patterns of URL strings without requiring a prior knowledge about phishing, and then uses the sequential pattern features for fast classification of the actual URL. For evaluations, comparisons are provided between different traditional machine learning models and deep learning models using various feature sets such as hand-crafted, character embedding, character level TF-IDF, and character level count vectors features. According to the experiments, the proposed model achieved an accuracy of 95.02% on our dataset and an accuracy of 98.58%, 95.46%, and 95.22% on benchmark datasets which outperform the existing phishing URL models.

An Effective Phishing Detection Model Based on Character Level Convolutional Neural Network from URL

期刊

ELECTRONICS

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

An Effective Phishing Detection Model Based on Character Level Convolutional Neural Network from URL

期刊

ELECTRONICS

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文