Journal
EMPIRICAL SOFTWARE ENGINEERING
Volume 21, Issue 2, Pages 368-410Publisher
SPRINGER
DOI: 10.1007/s10664-015-9387-3
Keywords
Issue-tracking systems; Bug-tracing systems; Duplicate bug reports; Triaging; Bug deduplication; Information retrieval; Software context
Categories
Funding
- NSERC Discovery Grants
Ask authors/readers for more resources
The issue-tracking systems used by software projects contain issues, bugs, or tickets written by a wide variety of bug reporters, with different levels of training and knowledge about the system under development. Typically, reporters lack the skills and/or time to search the issue-tracking system for similar issues already reported. As a result, many reports end up referring to the same issue, which effectively makes the bug-report triaging process time consuming and error prone. Many researchers have approached the bug-deduplication problem using off-the-shelf information-retrieval (IR) tools. In this work, we extend the state of the art by investigating how contextual information about software-quality attributes, software-architecture terms, and system-development topics can be exploited to improve bug deduplication. We demonstrate the effectiveness of our contextual bug-deduplication method at ranking duplicates on the bug repositories of the Android, Eclipse, Mozilla, and OpenOffice software systems. Based on this experience, we conclude that taking into account domain-specific context can improve IR methods for bug deduplication.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available