☆ 4.2 Article

A Hitchhiker's guide to statistical tests for assessing randomized algorithms in software engineering

SOFTWARE TESTING VERIFICATION & RELIABILITY (2014)

期刊

SOFTWARE TESTING VERIFICATION & RELIABILITY

卷 24, 期 3, 页码 219-250

出版社

WILEY

DOI: 10.1002/stvr.1486

关键词

statistical difference; effect size; parametric test; nonparametric test; confidence interval; Bonferroni adjustment; systematic review; survey

类别

Computer Science, Software Engineering

资金

Norwegian Research Council
FNR PEARL grant, Luxembourg

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Randomized algorithms are widely used to address many types of software engineering problems, especially in the area of software verification and validation with a strong emphasis on test automation. However, randomized algorithms are affected by chance and so require the use of appropriate statistical tests to be properly analysed in a sound manner. This paper features a systematic review regarding recent publications in 2009 and 2010 showing that, overall, empirical analyses involving randomized algorithms in software engineering tend to not properly account for the random nature of these algorithms. Many of the novel techniques presented clearly appear promising, but the lack of soundness in their empirical evaluations casts unfortunate doubts on their actual usefulness. In software engineering, although there are guidelines on how to carry out empirical analyses involving human subjects, those guidelines are not directly and fully applicable to randomized algorithms. Furthermore, many of the textbooks on statistical analysis are written from the viewpoints of social and natural sciences, which present different challenges from randomized algorithms. To address the questionable overall quality of the empirical analyses reported in the systematic review, this paper provides guidelines on how to carry out and properly analyse randomized algorithms applied to solve software engineering tasks, with a particular focus on software testing, which is by far the most frequent application area of randomized algorithms within software engineering. Copyright (c) 2012 John Wiley & Sons, Ltd.

A Hitchhiker's guide to statistical tests for assessing randomized algorithms in software engineering

期刊

SOFTWARE TESTING VERIFICATION & RELIABILITY

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A Hitchhiker's guide to statistical tests for assessing randomized algorithms in software engineering

期刊

SOFTWARE TESTING VERIFICATION & RELIABILITY

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文