4.6 Article

SeFACED: Semantic-Based Forensic Analysis and Classification of E-Mail Data Using Deep Learning

Journal

IEEE ACCESS
Volume 9, Issue -, Pages 98398-98411

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3095730

Keywords

Electronic mail; Unsolicited e-mail; Postal services; Phishing; Support vector machines; Logic gates; Forensics; Artificial intelligence; cybercrimes; multiclass e-mail classification; deep learning; cybersecurity

Ask authors/readers for more resources

AIoT is an emerging trend in industrial applications that requires proactive data analysis to prevent cyber-attacks and crimes. SeFACED is an efficient method for multiclass email classification that overcomes limitations of traditional search techniques, achieving performance improvements on long dependency data.
Artificial Intelligence (AI), in combination with the Internet of Things (IoT), called (AIoT), an emerging trend in industrial applications, is capable of intelligent decision-making with self-driven analytics. With its extensive usage in diverse scenarios, IoT devices generate bulk data contrived by attackers to disrupt normal operations and services. Hence, there is a need for proactive data analysis to prevent cyber-attacks and crimes. To investigate crimes involving Electronic Mail (e-mail), analysis of both the header and the email body is required since the semantics of communication helps to identify the source of potential evidence. With the continued growth of data shared via emails, investigators now face the daunting challenge of extracting the required semantic information from the bulks of emails, thereby causing a delay in the investigation process. This gives an edge to the criminal in erasing their footprints of malicious acts. The existing keyword-based search techniques and filtration often result in extraneous, short sequence emails, which skips meaningful information. To overcome the above limitation, we propose a novel efficient approach named SeFACED that uses Long Short-Term Memory (LSTM) based Gated Recurrent Neural Network (GRU) for multiclass email classification. SeFACED not only works on short sequences but with long dependencies of 1000+ characters as well. SeFACED focuses on tuning LSTM based GRU parameters to attain the best performance and with assessment by comparing it with traditional machine learning, deep learning models, and state-of-the-art studies on the subject. Experimental results on self-extended benchmark datasets exhibit that SeFACED effectively outperforms existing methods while keeping the classification process robust and reliable.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available