4.4 Article

Tackling the term-mismatch problem in automated trace retrieval

期刊

EMPIRICAL SOFTWARE ENGINEERING
卷 22, 期 3, 页码 1103-1142

出版社

SPRINGER
DOI: 10.1007/s10664-016-9479-8

关键词

Requirements engineering; Traceability; Query augmentation; Semantic traceability

资金

  1. US National Science Foundation [CCF-1319680, CCF-0447594]
  2. Direct For Computer & Info Scie & Enginr
  3. Division Of Computer and Network Systems [1649008] Funding Source: National Science Foundation
  4. Direct For Computer & Info Scie & Enginr
  5. Division of Computing and Communication Foundations [1319680, 1649448] Funding Source: National Science Foundation

向作者/读者索取更多资源

Software systems operating in any type of safety or security critical domains must comply with an increasingly large and complex set of regulatory standards. Compliance is partially demonstrated through establishing trace links between requirements and regulatory codes. Such links can be constructed manually or through semi-automated techniques in which the text in the regulatory code is used to formulate an information retrieval query. However, trace retrieval solutions are not effective when significant vocabulary mismatches exist between regulatory codes and product level requirements. This paper describes and compares three query augmentation techniques for addressing the term mismatch problem and improving the quality of trace links generated between regulatory codes and requirements. The first trains a classifier to replace the original query with terms learned from a training set of regulation-to-requirements trace links. The second, replaces the original query with terms learned through web-mining; and the third utilizes a domain ontology to augment query terms. The ontology is constructed manually using a guided approach that leverages existing traceability knowledge. All three techniques were evaluated against security regulations from the USA government's Health Insurance Privacy and Portability Act (HIPAA) traced against ten healthcare related requirements specifications. The classification approach returned the best results; however, improvements were observed with both the classification and ontology based solutions. The web-mining technique showed improvements in only a subset of queries. The three query augmentation techniques offer tradeoffs in terms of performance, cost and effort, and usage viability within a specific project context.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据