4.0 Article

A software package for the application of probabilistic anonymisation to sensitive individual-level data: a proof of principle with an example from the ALSPAC birth cohort study

期刊

LONGITUDINAL AND LIFE COURSE STUDIES
卷 9, 期 4, 页码 433-446

出版社

SOC LONGITUDINAL & LIFE COURSE STUDIES
DOI: 10.14301/llcs.v9i4.478

关键词

Probabilistic anonymisation; disclosure control; measurement error; h-rank index; ALSPAC

资金

  1. Wellcome Trust [102215/2/13/2]
  2. UK Economic and Social Research Council and Medical Research Council [ES/K000357/1]
  3. Department of Health and Social Care
  4. ESRC [ES/K000357/1] Funding Source: UKRI
  5. MRC [MR/K006525/1, MR/K007017/1] Funding Source: UKRI

向作者/读者索取更多资源

Individual-level data require protection from unauthorised access to safeguard confidentiality and security of sensitive information. Risks of disclosure are evaluated through privacy risk assessments and are controlled or minimised before data sharing and integration. The evolution from 'Micro Data Laboratory' traditions (i.e. access in controlled physical locations) to 'Open Data' (i.e. sharing individual-level data) drives the development of efficient anonymisation methods and protection controls. Effective anonymisation techniques should increase the uncertainty surrounding re-identification while retaining data utility, allowing informative data analysis. 'Probabilistic anonymisation' is one such technique, which alters the data by addition of random noise. In this paper, we describe the implementation of one probabilistic anonymisation technique into an operational software written in R and we demonstrate its applicability through application to analysis of asthma-related data from the ALSPAC cohort study. The software is designed to be used by data managers and users without the requirement of advanced statistical knowledge.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据