4.2 Article

Exact and approximate methods for data directed microaggregation in one or more dimensions

出版社

WORLD SCIENTIFIC PUBL CO PTE LTD
DOI: 10.1142/S0218488502001582

关键词

statistical disclosure control; statistical confidentiality; microdata release; microaggregation

向作者/读者索取更多资源

Microaggregation is a technique for the protection of the confidentiality of respondents in rnicrodata releases. It is used for economic data where respondent identifiability is high. Microaggregation releases the averages of small groups in which no single respondent is dominant. It was developed for univariate data. The data was sorted and the averages of adjacent fixed size groups were reported. The groups can be allowed to have varying sizes so that no group will include a large gap in the sorted data. The groups become more homogeneous when their boundaries are sensitive to the distribution of the data. This is like clustering but with the number of clusters chosen to be as large as possible subject to homogeneous clusters and a minimum cluster size. Approximate methods based on comparisons are developed. Exact methods based on linear optimization are also developed. For bivariate, or higher dimensional, data the notion of adjacency is defined even though sorting is no longer well defined. The constraints for minimum cluster size are also more elaborate and not so easily solved. We may also use only a triangulation to limit the number of adjacencies to be considered in the algorithms. Hybrids of the approximate and exact methods combine the strengths of each strategy.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据