4.0 Article

Understanding Cybersecurity Threat Trends Through Dynamic Topic Modeling

期刊

FRONTIERS IN BIG DATA
卷 4, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fdata.2021.601529

关键词

cybersecurity; cyberthreat information; knowledge graph; topic modeling; dynamic topic modeling

资金

  1. Hackerman Foundation

向作者/读者索取更多资源

Cybersecurity threats are on the rise and understanding the changing vulnerabilities can help combat new threats. Analyzing cybersecurity document collections through dynamic topic modeling reveals the importance of evolving concepts. Integrating different temporal corpora and representing data in a semantic knowledge graph supports integration, inference, and discovery, enhancing the quality of models.
Cybersecurity threats continue to increase and are impacting almost all aspects of modern life. Being aware of how vulnerabilities and their exploits are changing gives helpful insights into combating new threats. Applying dynamic topic modeling to a time-stamped cybersecurity document collection shows how the significance and details of concepts found in them are evolving. We correlate two different temporal corpora, one with reports about specific exploits and the other with research-oriented papers on cybersecurity vulnerabilities and threats. We represent the documents, concepts, and dynamic topic modeling data in a semantic knowledge graph to support integration, inference, and discovery. A critical insight into discovering knowledge through topic modeling is seeding the knowledge graph with domain concepts to guide the modeling process. We use Wikipedia concepts to provide a basis for performing concept phrase extraction and show how using those phrases improves the quality of the topic models. Researchers can query the resulting knowledge graph to reveal important relations and trends. This work is novel because it uses topics as a bridge to relate documents across corpora over time.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据