☆ 4.4 Article

Estimating the number of remaining links in traceability recovery

EMPIRICAL SOFTWARE ENGINEERING (2017)

Journal

EMPIRICAL SOFTWARE ENGINEERING

Volume 22, Issue 3, Pages 996-1027

Publisher

SPRINGER

DOI: 10.1007/s10664-016-9460-6

Keywords

Information retrieval; Traceability link recovery; Metrics and measurement

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Although very important in software engineering, establishing traceability links between software artifacts is extremely tedious, error-prone, and it requires significant effort. Even when approaches for automated traceability recovery exist, these provide the requirements analyst with a, usually very long, ranked list of candidate links that needs to be manually inspected. In this paper we introduce an approach called Estimation of the Number of Remaining Links (ENRL) which aims at estimating, via Machine Learning (ML) classifiers, the number of remaining positive links in a ranked list of candidate traceability links produced by a Natural Language Processing techniques-based recovery approach. We have evaluated the accuracy of the ENRL approach by considering several ML classifiers and NLP techniques on three datasets from industry and academia, and concerning traceability links among different kinds of software artifacts including requirements, use cases, design documents, source code, and test cases. Results from our study indicate that: (i) specific estimation models are able to provide accurate estimates of the number of remaining positive links; (ii) the estimation accuracy depends on the choice of the NLP technique, and (iii) univariate estimation models outperform multivariate ones.

Estimating the number of remaining links in traceability recovery

Journal

EMPIRICAL SOFTWARE ENGINEERING

Publisher

SPRINGER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Estimating the number of remaining links in traceability recovery

Journal

EMPIRICAL SOFTWARE ENGINEERING

Publisher

SPRINGER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper