4.5 Article

Sumav: Fully automated malware labeling

Journal

ICT EXPRESS
Volume 8, Issue 4, Pages 530-538

Publisher

ELSEVIER
DOI: 10.1016/j.icte.2022.02.007

Keywords

Malware; Labeling; AV labels; Clustering; Classification

Funding

  1. Korea Institute of Energy Technology Evaluation and Planning (KETEP)
  2. Ministry of Trade, Industry & Energy (MOTIE) of the Republic of Korea [20212020800120]
  3. Korea Evaluation Institute of Industrial Technology (KEIT) [20212020800120] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

The study presents Sumav, an automated labeling tool that assigns family names to files based on antivirus labels. It can provide high-quality labeling performance even in cases of sudden changes to the antivirus label system.
Multiple AV engines are used to ensure more effective system protection against malicious files. These AV engines are capable of distinguishing between benign and malicious files, but even if a file of interest is proven to be malicious, it is still necessary to refer to a list of AV labels provided by each AV engine to determine what family name the malicious file belongs to. However, oftentimes, such AV labels lack a consistent naming scheme, and even family names differ from one AV engine to another. The present study presents Sumav, a fully automated labeling tool that assigns each file a family name based on AV labels. According to previous studies, such a task required prior knowledge or malicious file datasets that had already been labeled. In contrast, Sumav can assign family names with only the AV labels. This system also requires no maintenance and can provide high-quality labeling performance even if sudden changes have been made to the AV label system. (C) 2022 The Author(s). Published by Elsevier B.V. on behalf of The Korean Institute of Communications and Information Sciences.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available