4.6 Article

Embedding vector generation based on function call graph for effective malware detection and classification

期刊

NEURAL COMPUTING & APPLICATIONS
卷 34, 期 11, 页码 8643-8656

出版社

SPRINGER LONDON LTD
DOI: 10.1007/s00521-021-06808-8

关键词

Malware detection; Malware classification; Function call graph; Graph embedding; Attention mechanism

向作者/读者索取更多资源

The surge of malware poses a huge threat to cyberspace security. Existing malware analysis methods rely on feature engineering, increasing the complexity of analysis. This research proposes a new method based on function call graph and graph embedding network, which automatically extracts semantic features for efficient malware analysis.
The surge of malware poses a huge threat to cyberspace security. The existing malware analysis methods based on machine learning mainly rely on feature engineering. These methods need to extract many handcrafted features from the malware to improve accuracy, which increases the complexity of malware analysis. In order to solve this problem, this paper proposes GEMAL, a new malware analysis method based on function call graph (FCG) and graph embedding network. FCG contains the structure information of the binary file and has been used in various research of malware analysis. Inspired by natural language processing tasks, we treat instructions as words and functions as sentences, so that we can automatically extract semantic features using the natural language processing method. We use an attention mechanism based graph embedding network to combine structural features and semantic features to generate embedding vectors of malware for automatic and efficient malware analysis. We use two datasets to test the efficiency of GEMAL. One is a self-created dataset named WUFCG, which contains 70,188 real-world samples. The other one is the public dataset of the Microsoft Malware Classification Challenge, which contains 10,868 samples. Experimental results show that GEMAL can detect real-world malware with 99.16% accuracy and classify malware families with the best accuracy of 99.81%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据