4.3 Article

Proxy expenditure weights for Consumer Price Index: Audit sampling inference for big-data statistics

出版社

WILEY
DOI: 10.1111/rssa.12632

关键词

evaluation coverage; privacy protection; proxy source effect; survey burden and cost

向作者/读者索取更多资源

This study discusses the use of purchase data from retail chains as proxy measures of private household expenditure, highlighting the bias and variance issues associated with these proxy measures. An audit sampling inference approach is proposed to investigate the potential of replacing costly and burdensome surveys with non-survey big-data sources. However, in some cases, meaningful results may not be obtained using this method.
Purchase data from retail chains can provide proxy measures of private household expenditure on items that are the most troublesome to collect in the traditional expenditure survey. Due to the inevitable coverage and selection errors, bias must exist in these proxy measures. Moreover, given the sheer amount of data, the bias completely dominates the variance. To investigate the potential of replacing costly and burdensome surveys by non-survey big-data sources, we propose an audit sampling inference approach, which does not require linking the audit sample and the big-data source at the individual level. It turns out that one is unable to reject a null hypothesis of unbiased big-data estimation at the chosen size, because the audit sampling variance is too large compared to the bias of the big-data estimate. For the same reason, audit sampling fails to yield a meaningful mean squared error estimate. We propose a novel accuracy measure that is generally applicable in such situations. This can provide a necessary part of the statistical argument for the uptake of non-survey big-data sources, in replacement of traditional survey sampling. An application to disaggregated food price indices is used to demonstrate the proposed approach.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据