4.2 Article

Sampling Techniques for Big Data Analysis

Journal

INTERNATIONAL STATISTICAL REVIEW
Volume 87, Issue -, Pages S177-S191

Publisher

WILEY
DOI: 10.1111/insr.12290

Keywords

Data integration; inverse sampling; non-probability sample; selection bias

Funding

  1. US National Science Foundation

Ask authors/readers for more resources

In analysing big data for finite population inference, it is critical to adjust for the selection bias in the big data. In this paper, we propose two methods of reducing the selection bias associated with the big data sample. The first method uses a version of inverse sampling by incorporating auxiliary information from external sources, and the second one borrows the idea of data integration by combining the big data sample with an independent probability sample. Two simulation studies show that the proposed methods are unbiased and have better coverage rates than their alternatives. In addition, the proposed methods are easy to implement in practice.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available