期刊
INTERNATIONAL STATISTICAL REVIEW
卷 79, 期 3, 页码 362-384出版社
WILEY-BLACKWELL
DOI: 10.1111/j.1751-5823.2011.00153.x
关键词
Economic census; data confidentiality; synthetic data; disclosure limitation
资金
- NSF [ITR-0427889]
- Direct For Social, Behav & Economic Scie
- Divn Of Social and Economic Sciences [0922005] Funding Source: National Science Foundation
- Direct For Social, Behav & Economic Scie
- Divn Of Social and Economic Sciences [1042181] Funding Source: National Science Foundation
In most countries, national statistical agencies do not release establishment-level business microdata, because doing so represents too large a risk to establishments' confidentiality. One approach with the potential for overcoming these risks is to release synthetic data; that is, the released establishment data are simulated fromstatistical models designed to mimic the distributions of the underlying real microdata. In this article, we describe an application of this strategy to create a public use file for the Longitudinal Business Database, an annual economic census of establishments in the United States comprising more than 20 million records dating back to 1976. The U. S. Bureau of the Census and the Internal Revenue Service recently approved the release of these synthetic microdata for public use, making the synthetic Longitudinal Business Database the first-ever business microdata set publicly released in the United States. We describe how we created the synthetic data, evaluated analytical validity, and assessed disclosure risk.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据