4.4 Article

A STATISTICAL ANALYSIS OF NOISY CROWDSOURCED WEATHER DATA

Journal

ANNALS OF APPLIED STATISTICS
Volume 14, Issue 1, Pages 116-142

Publisher

INST MATHEMATICAL STATISTICS-IMS
DOI: 10.1214/19-AOAS1290

Keywords

Veracity score; geostatistics; robust kriging; hyperlocal spatial prediction

Funding

  1. Laboratory for Analytic Sciences (LAS)
  2. NSF Grant [DMS-1613192]

Ask authors/readers for more resources

Spatial prediction of weather elements like temperature, precipitation, and barometric pressure are generally based on satellite imagery or data collected at ground stations. None of these data provide information at a more granular or hyperlocal resolution. On the other hand, crowdsourced weather data, which are captured by sensors installed on mobile devices and gathered by weather-related mobile apps like Weather Signal and AccuWeather, can serve as potential data sources for analyzing environmental processes at a hyperlocal resolution. However, due to the low quality of the sensors and the nonlaboratory environment, the quality of the observations in crowdsourced data is compromised. This paper describes methods to improve hyperlocal spatial prediction using this varying-quality, noisy crowdsourced information. We introduce a reliability metric, namely Veracity Score (VS), to assess the quality of the crowdsourced observations using a coarser, but high-quality, reference data. A VS-based methodology to analyze noisy spatial data is proposed and evaluated through extensive simulations. The merits of the proposed approach are illustrated through case studies analyzing crowdsourced daily average ambient temperature readings for one day in the contiguous United States.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available