4.2 Article

A new perspective on data homogeneity in software cost estimation: a study in the embedded systems domain

Journal

SOFTWARE QUALITY JOURNAL
Volume 18, Issue 1, Pages 57-80

Publisher

SPRINGER
DOI: 10.1007/s11219-009-9081-z

Keywords

Application domain; Cost estimation; Data homogeneity; Embedded software; Machine learning

Funding

  1. Bogazici University [BAP 06HA104]
  2. Tubitak [EEEAG 108E014]

Ask authors/readers for more resources

Cost estimation and effort allocation are the key challenges for successful project planning and management in software development. Therefore, both industry and the research community have been working on various models and techniques to accurately predict the cost of projects. Recently, researchers have started debating whether the prediction performance depends on the structure of data rather than the models used. In this article, we focus on a new aspect of data homogeneity, cross-versus within-application domain'', and investigate what kind of training data should be used for software cost estimation in the embedded systems domain. In addition, we try to find out the effect of training dataset size on the prediction performance. Based on our empirical results, we conclude that it is better to use cross-domain data for embedded software cost estimation and the optimum training data size depends on the method used.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available