4.6 Article

FAMD: A Fast Multifeature Android Malware Detection Framework, Design, and Implementation

Journal

IEEE ACCESS
Volume 8, Issue -, Pages 194729-194740

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2020.3033026

Keywords

Malware; Feature extraction; Static analysis; Security; Support vector machines; Machine learning; Virtual machining; Android malware; CatBoost; Dalvik opcode; malware detection

Funding

  1. 13th Five-Year Science and Technology Research Project of the Education Department of Jilin Province [JJKH20200794KJ]
  2. Innovation Fund of Changchun University of Science and Technology [XJJLG-2018-09]
  3. Key Laboratory of Symbolic Computation and Knowledge Engineering, Ministry of Education, Jilin University [93K172018K05]

Ask authors/readers for more resources

With Androids dominant position within the current smartphone OS, increasing number of malware applications pose a great threat to user privacy and security. Classification algorithms that use a single feature usually have weak detection performance. Although the use of multiple features can improve the detection effect, increasing the number of features increases the requirements of the operating environment and consumes more time. We propose a fast Android malware detection framework based on the combination of multiple features: FAMD (Fast Android Malware Detector). First, we extracted permissions and Dalvik opcode sequences from samples to construct the original feature set. Second, the Dalvik opcodes are preprocessed with the N-Gram technique, and the FCBF (Fast Correlation-Based Filter) algorithm based on symmetrical uncertainty is employed to reduce feature dimensionality. Finally, the dimensionality-reduced features are input into the CatBoost classifier for malware detection and family classification. The dataset DS-1, which we collected, and the baseline dataset Drebin were used in the experiment. The results show that the combined features can effectively improve the detection accuracy of malware that can reach 97.40% on Drebin dataset, and the malware family classification accuracy can achieve 97.38% Compared with other state-of-the-art works, our framework achieves higher accuracy and lower time consumption.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available