4.6 Article

DEDUCE: A pattern matching method for automatic de-identification of Dutch medical text

期刊

TELEMATICS AND INFORMATICS
卷 35, 期 4, 页码 727-736

出版社

ELSEVIER
DOI: 10.1016/j.tele.2017.08.002

关键词

De-identification; Dutch medical text; Pattern matching; Protected Health Information; Patient privacy

向作者/读者索取更多资源

In order to use medical text for research purposes, it is necessary to de-identify the text for legal and privacy reasons. We report on a pattern matching method to automatically de-identify medical text written in Dutch, which requires a low amount of effort to be hand tailored. First, a selection of Protected Health Information (PHI) categories is determined in cooperation with medical staff. Then, we devise a method for de-identifying all information in one of these PHI categories, that relies on lookup tables, decision rules and fuzzy string matching. Our de-identification method DEDUCE is validated on a test corpus of 200 nursing notes and 200 treatment plans obtained from the University Medical Center Utrecht (UMCU) in the Netherlands, achieving a total micro-averaged precision of 0.814, a recall of 0.916 and a F-1-score of 0.862. For person names, a recall of 0.964 was achieved, while no names of patients were missed.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据