4.1 Article

Locally weighted regression with different kernel smoothers for software effort estimation

期刊

SCIENCE OF COMPUTER PROGRAMMING
卷 214, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.scico.2021.102744

关键词

Effort estimation; Locally weighted regression; Kernel function; k-nearest neighbors

资金

  1. Applied Science Private University, Amman, Jordan
  2. Princess Sumaya University for Technology, Amman, Jordan

向作者/读者索取更多资源

Estimating software effort has been a challenge, and this paper introduces a more sophisticated locality approach called Locally Weighted Regression (LWR) to learn from local data and build estimation models with multiple local regression models.
Estimating software effort has been a largely unsolved problem for decades. One of the main reasons that hinders building accurate estimation models is the often heterogeneous nature of software data with a complex structure. Typically, building effort estimation models from local data tend to be more accurate than using the entire data. Previous studies have focused on the use of clustering techniques and decision trees to generate local and coherent data that can help in building local prediction models. However, these approaches may fall short in some aspect due to limitations in finding optimal clusters and processing noisy data. In this paper we used a more sophisticated locality approach that can mitigate these shortcomings that is Locally Weighted Regression (LWR). This method provides an efficient solution to learn from local data by building an estimation model that combines multiple local regression models in k-nearest-neighbor based model. The main factor affecting the accuracy of this method is the choice of the kernel function used to derive the weights for local regression models. This paper investigates the effects of choosing different kernels on the performance of Locally Weighted Regression of a software effort estimation problem. After comprehensive experiments with 7 datasets, 10 kernels, 3 polynomial degrees and 4 bandwidth values with a total of 840 Locally Weighted Regression variants, we found that: 1) Uniform kernel functions cannot outperform non-uniform kernel functions, and 2) kernel type, polynomial degrees and bandwidth parameters have no specific effect on the estimation accuracy. In other words, no change in bandwidth or degree values occurred with a significant difference in kernel rankings. In short, Locally Weighted Regression methods with Triweight or Triangle kernel can perform better than more complex kernels. Hence, we encourage non-uniform kernel methods as smoother function with wide bandwidth and small polynomial degree. (C) 2021 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.1
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据