4.4 Article

A contextual approach towards more accurate duplicate bug report detection and ranking

Journal

EMPIRICAL SOFTWARE ENGINEERING
Volume 21, Issue 2, Pages 368-410

Publisher

SPRINGER
DOI: 10.1007/s10664-015-9387-3

Keywords

Issue-tracking systems; Bug-tracing systems; Duplicate bug reports; Triaging; Bug deduplication; Information retrieval; Software context

Funding

  1. NSERC Discovery Grants

Ask authors/readers for more resources

The issue-tracking systems used by software projects contain issues, bugs, or tickets written by a wide variety of bug reporters, with different levels of training and knowledge about the system under development. Typically, reporters lack the skills and/or time to search the issue-tracking system for similar issues already reported. As a result, many reports end up referring to the same issue, which effectively makes the bug-report triaging process time consuming and error prone. Many researchers have approached the bug-deduplication problem using off-the-shelf information-retrieval (IR) tools. In this work, we extend the state of the art by investigating how contextual information about software-quality attributes, software-architecture terms, and system-development topics can be exploited to improve bug deduplication. We demonstrate the effectiveness of our contextual bug-deduplication method at ranking duplicates on the bug repositories of the Android, Eclipse, Mozilla, and OpenOffice software systems. Based on this experience, we conclude that taking into account domain-specific context can improve IR methods for bug deduplication.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available