☆ 4.6 Article

Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction

ENTROPY (2022)

期刊

ENTROPY

卷 24, 期 5, 页码 -

出版社

MDPI

DOI: 10.3390/e24050651

关键词

vulnerability prediction; dataset extension; software metrics; text mining; machine learning; deep learning; ensemble learning

类别

Physics, Multidisciplinary

资金

European Union [952684]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Software security is vital for software development organizations to provide high-quality and reliable software. Early detection of vulnerabilities is crucial, and text mining-based vulnerability prediction models have shown better performance compared to software metrics-based models.

Software security is a very important aspect for software development organizations who wish to provide high-quality and dependable software to their consumers. A crucial part of software security is the early detection of software vulnerabilities. Vulnerability prediction is a mechanism that facilitates the identification (and, in turn, the mitigation) of vulnerabilities early enough during the software development cycle. The scientific community has recently focused a lot of attention on developing Deep Learning models using text mining techniques for predicting the existence of vulnerabilities in software components. However, there are also studies that examine whether the utilization of statically extracted software metrics can lead to adequate Vulnerability Prediction Models. In this paper, both software metrics- and text mining-based Vulnerability Prediction Models are constructed and compared. A combination of software metrics and text tokens using deep-learning models is examined as well in order to investigate if a combined model can lead to more accurate vulnerability prediction. For the purposes of the present study, a vulnerability dataset containing vulnerabilities from real-world software products is utilized and extended. The results of our analysis indicate that text mining-based models outperform software metrics-based models with respect to their F-2-score, whereas enriching the text mining-based models with software metrics was not found to provide any added value to their predictive performance.

Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction

期刊

ENTROPY

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction

期刊

ENTROPY

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文