☆ 4.4 Article

Studying just-in-time defect prediction using cross-project models

EMPIRICAL SOFTWARE ENGINEERING (2016)

Journal

EMPIRICAL SOFTWARE ENGINEERING

Volume 21, Issue 5, Pages 2072-2106

Publisher

SPRINGER

DOI: 10.1007/s10664-015-9400-x

Keywords

Empirical study; Defect prediction; Just-in-time prediction

Funding

JSPS [15H05306, 24680003]
Natural Sciences and Engineering Research Council of Canada (NSERC)
Grants-in-Aid for Scientific Research [15H05306, 25540026, 24680003] Funding Source: KAKEN

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Unlike traditional defect prediction models that identify defect-prone modules, Just-In-Time (JIT) defect prediction models identify defect-inducing changes. As such, JIT defect models can provide earlier feedback for developers, while design decisions are still fresh in their minds. Unfortunately, similar to traditional defect models, JIT models require a large amount of training data, which is not available when projects are in initial development phases. To address this limitation in traditional defect prediction, prior work has proposed cross-project models, i.e., models learned from other projects with sufficient history. However, cross-project models have not yet been explored in the context of JIT prediction. Therefore, in this study, we empirically evaluate the performance of JIT models in a cross-project context. Through an empirical study on 11 open source projects, we find that while JIT models rarely perform well in a cross-project context, their performance tends to improve when using approaches that: (1) select models trained using other projects that are similar to the testing project, (2) combine the data of several other projects to produce a larger pool of training data, and (3) combine the models of several other projects to produce an ensemble model. Our findings empirically confirm that JIT models learned using other projects are a viable solution for projects with limited historical data. However, JIT models tend to perform best in a cross-project context when the data used to learn them are carefully selected.

Studying just-in-time defect prediction using cross-project models

Journal

EMPIRICAL SOFTWARE ENGINEERING

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Studying just-in-time defect prediction using cross-project models

Journal

EMPIRICAL SOFTWARE ENGINEERING

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper