4.6 Article

A Novel Approach to Improve Software Defect Prediction Accuracy Using Machine Learning

期刊

IEEE ACCESS
卷 11, 期 -, 页码 63579-63597

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2023.3287326

关键词

Defect prediction; accuracy; feature selection; machine learning

向作者/读者索取更多资源

In the software engineering community, defect prediction is an active domain that aims to reduce the gap between software engineering and data mining. Software defects prediction utilizes various methods such as clustering, statistical methods, and machine learning algorithms to identify potential errors in source code before testing. The main contribution of this research is the use of feature selection for the first time, enhancing the accuracy of machine learning classifiers in defect prediction. The objective is to improve the defect prediction accuracy in five NASA datasets by using feature selection techniques with machine learning algorithms, leading to higher accuracy compared to without feature selection.
In software engineering community, defect prediction is one the active domain. For the software's success, it is essential to reduce the software engineering and data-mining gap. Software defects prediction forecasts the source code errors before the testing phase. Methods for predicting software defects, such as clustering, statistical methods, mixed algorithms, metrics based on neural networks, black box testing, white box testing and machine learning are frequently used to explore the effect area in software. The main contribution of this research is the use of feature selection for the first time to increase the accuracy of machine learning classifiers in defects pre-diction. The objective of this study is to improve the defects prediction accuracy in five data sets of NASA namely; CM1, JM1, KC2, KC1, and PC1. These NASA data sets are open to public. In this research, the feature selection technique is use with machine-learning techniques; Random Forest, Logistic Regression, Multilayer Perceptron, Bayesian Net, Rule ZeroR, J48, Lazy IBK, Support Vector Machine, Neural Networks, and Decision Stump to achieve high defect prediction accuracy as compared to without feature selection (WOFS). The research workbench, a machine-learning tool called WEKA (Waikato Environment for Knowledge Analysis), is used to refine da-ta, preprocess data, and apply the mentioned classifiers. To assess statistical analyses, a mini tab statistical tool is used. The results of this study reveals that accuracy of defects prediction with feature selection (WFS) is improve in contrast with the accuracy of WOFS.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据