4.6 Article

Robust Malware Family Classification Using Effective Features and Classifiers

期刊

APPLIED SCIENCES-BASEL
卷 12, 期 15, 页码 -

出版社

MDPI
DOI: 10.3390/app12157877

关键词

malware classification; deep learning; Tamura; GoogLeNet; KNN; SVM; ELM

资金

  1. Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia [PNURSP2022R195]

向作者/读者索取更多资源

The development of malware poses a serious security risk, and existing classification techniques have low accuracy due to the difficulty in finding accurate features and dealing with data imbalance. A proposed method for malware classification achieves extremely high accuracy, surpassing traditional hand-crafted and deep feature techniques.
Malware development has significantly increased recently, posing a serious security risk to both consumers and businesses. Malware developers continually find new ways to circumvent security research's ongoing efforts to guard against malware attacks. Malware Classification (MC) entails labeling a class of malware to a specific sample, while malware detection merely entails finding malware without identifying which kind of malware it is. There are two main reasons why the most popular MC techniques have a low classification rate. First, Finding and developing accurate features requires highly specialized domain expertise. Second, a data imbalance that makes it challenging to classify and correctly identify malware. Furthermore, the proposed malware classification (MC) method consists of the following five steps: (i) Dataset preparation: 2D malware images are created from the malware binary files; (ii) Visualized Malware Pre-processing: the visual malware images need to be scaled to fit the CNN model's input size; (iii) Feature extraction: both hand-engineering (Tamura) and deep learning (GoogLeNet) techniques are used to extract the features in this step; (iv) Classification: to perform malware classification, we employed k-Nearest Neighbor (KNN), Support Vector Machines (SVM), and Extreme Learning Machine (ELM). The proposed method is tested on a standard Malimg unbalanced dataset. The accuracy rate of the proposed method was extremely high, making it the most efficient option available. The proposed method's accuracy rate was outperformed both the Hand-crafted feature and Deep Feature techniques, at 95.42 and 96.84 percent.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据