4.6 Article

Machine learning models for phishing detection from TLS traffic

Publisher

SPRINGER
DOI: 10.1007/s10586-023-04042-6

Keywords

Phishing; URL; Anti-phishing; ML; TLS 1.2; TLS 1.3

Ask authors/readers for more resources

This paper proposes a novel machine learning-based technique to detect phishing URLs in encrypted traffic at the transport layer using TLS 1.2 and TLS 1.3. The proposed model extracts features from the encrypted traffic and performs phishing detection using ML algorithms. Experimental results show that the model effectively detects phishing URLs, achieving an accuracy of 95.40% for Light GBM (LGBM), the highest among the tested algorithms.
Phishing is a fraudulent tactic for attackers to obtain victims personal information, such as passwords, account details, credit card details, and other sensitive information. Existing anti-phishing detection methods using at the application layer and cannot be applied at the transport layer. A novel machine learning (ML) based phishing detection technique from transport layer security (TLS) 1.2 and TLS 1.3 encrypted traffic without decryption is proposed in this paper. Our proposed model detects phishing URLs at the transport layer and classifies them as legitimate or phishing. The features are extracted from TLS 1.2 and TLS 1.3 traffic, and phishing detection is performed using ML algorithms based on the extracted features. The datasets for legitimate and phishing sites are created using features derived from TLS 1.2 and TLS 1.3 traffic. According to the experimental results, the proposed model effectively detects phishing URLs in encrypted traffic. The proposed model achieves an accuracy of 93.63% for Random Forest (RF), 95.07% for XGBoost (XGB), and the highest accuracy of 95.40% for Light GBM (LGBM).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available