4.7 Article

Cost-effective ensemble models selection using deep reinforcement learning

Journal

INFORMATION FUSION
Volume 77, Issue -, Pages 133-148

Publisher

ELSEVIER
DOI: 10.1016/j.inffus.2021.07.011

Keywords

Malware detection; Reinforcement learning; Transfer learning; Portable executable; Android package

Ask authors/readers for more resources

Ensemble learning is a common technique that applies multiple learning models to the same task to improve classification accuracy. SPIREL is a novel cost-effective classification method that dynamically assigns different learning models and classification thresholds, showing high cost-effectiveness in large malware datasets.
Ensemble learning - the application of multiple learning models on the same task - is a common technique in multiple domains. While employing multiple models enables reaching higher classification accuracy, this process can be time consuming, costly, and make scaling more difficult. Given that each model may have different capabilities and costs, assigning the most cost-effective set of learners for each sample is challenging. We propose SPIREL, a novel method for cost-effective classification. Our method enables users to directly associate costs to correct/incorrect label assignment, computing resources and run-time, and then dynamically establishes a classification policy. For each analyzed sample, SPIREL dynamically assigns a different set of learning models, as well as its own classification threshold. Extensive evaluation on two large malware datasets - a domain in which the application of multiple analysis tools is common - demonstrates that SPIREL is highly cost-effective, enabling us to reduce running time by similar to 80% while decreasing the accuracy and F1-score by only 0.5%. We also show that our approach is both highly transferable across different datasets and adaptable to changes in individual learning model performance.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available