☆ 4.2 Article

PDRCNN: Precise Phishing Detection with Recurrent Convolutional Neural Networks

SECURITY AND COMMUNICATION NETWORKS (2019)

期刊

SECURITY AND COMMUNICATION NETWORKS

卷 2019, 期 -, 页码 -

出版社

WILEY-HINDAWI

DOI: 10.1155/2019/2595794

关键词

类别

Computer Science, Information Systems Telecommunications

资金

National Natural Science Foundation of China [61672543, 61772559]
Open Research Fund of Hunan Provincial Key Laboratory of Network Investigational Technology [2017WLZC002, 2017WLZC003]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Through well-designed counterfeit websites, phishing induces online users to visit forged web pages to obtain their private sensitive information, e.g., account number and password. Existing antiphishing approaches are mostly based on page-related features, which require to crawl content of web pages as well as accessing third-party search engines or DNS services. This not only leads to their low efficiency in detecting phishing but also makes them rely on network environment and third-party services heavily. In this paper, we propose a fast phishing website detection approach called PDRCNN that relies only on the URL of the website. PDRCNN neither needs to retrieve content of the target website nor uses any third-party services as previous approaches do. It encodes the information of an URL into a two-dimensional tensor and feeds the tensor into a novelly designed deep learning neural network to classify the original URL. We first use a bidirectional LSTM network to extract global features of the constructed tensor and give all string information to each character in the URL. After that, we use a CNN to automatically judge which characters play key roles in phishing detection, capture the key components of the URL, and compress the extracted features into a fixed length vector space. By combining the two types of networks, PDRCNN achieves better performance than just using either one of them. We built a dataset containing nearly 500,000 URLs which are obtained through Alexa and PhishTank. Experimental results show that PDRCNN achieves a detection accuracy of 97% and an AUC value of 99%, which is much better than state-of-the-art approaches. Furthermore, the recognition process is very fast: on the trained PDRCNN model, the average per URL detection time only cost 0.4 ms.

PDRCNN: Precise Phishing Detection with Recurrent Convolutional Neural Networks

期刊

SECURITY AND COMMUNICATION NETWORKS

出版社

WILEY-HINDAWI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

PDRCNN: Precise Phishing Detection with Recurrent Convolutional Neural Networks

期刊

SECURITY AND COMMUNICATION NETWORKS

出版社

WILEY-HINDAWI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文