☆ 4.4 Article

Tackling the term-mismatch problem in automated trace retrieval

EMPIRICAL SOFTWARE ENGINEERING (2017)

期刊

EMPIRICAL SOFTWARE ENGINEERING

卷 22, 期 3, 页码 1103-1142

出版社

SPRINGER

DOI: 10.1007/s10664-016-9479-8

关键词

Requirements engineering; Traceability; Query augmentation; Semantic traceability

类别

Computer Science, Software Engineering

资金

US National Science Foundation [CCF-1319680, CCF-0447594]
Direct For Computer & Info Scie & Enginr
Division Of Computer and Network Systems [1649008] Funding Source: National Science Foundation
Direct For Computer & Info Scie & Enginr
Division of Computing and Communication Foundations [1319680, 1649448] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Software systems operating in any type of safety or security critical domains must comply with an increasingly large and complex set of regulatory standards. Compliance is partially demonstrated through establishing trace links between requirements and regulatory codes. Such links can be constructed manually or through semi-automated techniques in which the text in the regulatory code is used to formulate an information retrieval query. However, trace retrieval solutions are not effective when significant vocabulary mismatches exist between regulatory codes and product level requirements. This paper describes and compares three query augmentation techniques for addressing the term mismatch problem and improving the quality of trace links generated between regulatory codes and requirements. The first trains a classifier to replace the original query with terms learned from a training set of regulation-to-requirements trace links. The second, replaces the original query with terms learned through web-mining; and the third utilizes a domain ontology to augment query terms. The ontology is constructed manually using a guided approach that leverages existing traceability knowledge. All three techniques were evaluated against security regulations from the USA government's Health Insurance Privacy and Portability Act (HIPAA) traced against ten healthcare related requirements specifications. The classification approach returned the best results; however, improvements were observed with both the classification and ontology based solutions. The web-mining technique showed improvements in only a subset of queries. The three query augmentation techniques offer tradeoffs in terms of performance, cost and effort, and usage viability within a specific project context.

Tackling the term-mismatch problem in automated trace retrieval

期刊

EMPIRICAL SOFTWARE ENGINEERING

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Tackling the term-mismatch problem in automated trace retrieval

期刊

EMPIRICAL SOFTWARE ENGINEERING

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文