4.7 Article

Intercepting Hail Hydra: Real-time detection of Algorithmically Generated Domains

期刊

出版社

ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD
DOI: 10.1016/j.jnca.2021.103135

关键词

Malware; Domain Generation Algorithms; Botnets; DNS; Algorithmically Generated Domain

资金

  1. European Commission [830929, 832735, 780498]
  2. H2020 ECSEL project VALU3S [876852]
  3. Brno University of Technology [FITS206427]

向作者/读者索取更多资源

The study examines the technical challenges faced by cybercriminals in controlling botnets and the use of Domain Generation Algorithms to evade take-down attempts. The HYDRAS dataset is introduced as the most comprehensive and representative dataset, highlighting different families and variants. Results show that the proposed approach significantly outperforms the current state-of-the-art in terms of both classification performance and efficiency.
A crucial technical challenge for cybercriminals is to keep control over the potentially millions of infected devices that build up their botnets, without compromising the robustness of their attacks. A single, fixed C&C server, for example, can be trivially detected either by binary or traffic analysis and immediately sink-holed or taken-down by security researchers or law enforcement. Botnets often use Domain Generation Algorithms (DGAs), primarily to evade take-down attempts. DGAs can enlarge the lifespan of a malware campaign, thus potentially enhancing its profitability. They can also contribute to hindering attack accountability. In this work, we introduce HYDRAS, the most comprehensive and representative dataset of AlgorithmicallyGenerated Domains (AGD) available to date. The dataset contains more than 100 DGA families, including both real-world and adversarially designed ones. We analyse the dataset and discuss the possibility of differentiating between benign requests (to real domains) and malicious ones (to AGDs) in real-time. The simultaneous study of so many families and variants introduces several challenges; nonetheless, it alleviates biases found in previous literature employing small datasets which are frequently overfitted, exploiting characteristic features of particular families that do not generalise well. We thoroughly compare our approach with the current state-of-the-art and highlight some methodological shortcomings in the actual state of practice. The outcomes obtained show that our proposed approach significantly outperforms the current state-of-the-art in terms of both classification performance and efficiency.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据