Journal
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY
Volume 45, Issue 3, Pages 954-964Publisher
OXFORD UNIV PRESS
DOI: 10.1093/ije/dyv322
Keywords
Record linkage; epidemiological methods; medical record linkage; bias; data linkage
Categories
Funding
- MRC Fellowship [MR/L01226X/1]
- MRC [MR/L01226X/1] Funding Source: UKRI
- Medical Research Council [MR/L01226X/1] Funding Source: researchfish
- National Institute for Health Research [NF-SI-0515-10048] Funding Source: researchfish
Ask authors/readers for more resources
Studies involving the use of probabilistic record linkage are becoming increasingly common. However, the methods underpinning probabilistic record linkage are not widely taught or understood, and therefore these studies can appear to be a 'black box' research tool. In this article, we aim to describe the process of probabilistic record linkage through a simple exemplar. We first introduce the concept of deterministic linkage and contrast this with probabilistic linkage. We illustrate each step of the process using a simple exemplar and describe the data structure required to perform a probabilistic linkage. We describe the process of calculating and interpreting matched weights and how to convert matched weights into posterior probabilities of a match using Bayes theorem. We conclude this article with a brief discussion of some of the computational demands of record linkage, how you might assess the quality of your linkage algorithm, and how epidemiologists can maximize the value of their record-linked research using robust record linkage methods.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available