4.7 Review

Big Geospatial Data and Data-Driven Methods for Urban Dengue Risk Forecasting: A Review

Journal

REMOTE SENSING
Volume 14, Issue 19, Pages -

Publisher

MDPI
DOI: 10.3390/rs14195052

Keywords

dengue; risk forecasting; big geospatial data; data-driven models; review

Funding

  1. Key Research Program of Frontier Sciences of the Chinese Academy of Sciences (CAS) [QYZDBSSW-DQC005]
  2. CAS [XDA19040301]
  3. Institute of Geographic Sciences and Natural Resources Research of the CAS [E0V00110YZ]

Ask authors/readers for more resources

With advancements in big geospatial data and artificial intelligence, multi-source data and diverse data-driven methods are commonly used in dengue risk prediction. The review found that the main predictors in dengue risk prediction include climate conditions, historical cases, vegetation indices, and human mobility. These predictors are derived from various sources such as meteorological datasets, public websites, and health department data. The spatial scales used range from city-level to neighborhood-level, with a focus on city-level analysis. The most popular temporal scales are monthly and weekly. Data-driven methods are categorized into single models, ensemble learning, and hybrid learning, with a dominance of regression-based approaches. Model evaluation focuses on comparing observed and predicted values, determining outbreak occurrences, and assessing model uncertainty. Future dengue risk forecasting will prioritize big geospatial data, data cloud computing, and deep learning models.
With advancements in big geospatial data and artificial intelligence, multi-source data and diverse data-driven methods have become common in dengue risk prediction. Understanding the current state of data and models in dengue risk prediction enables the implementation of efficient and accurate prediction in the future. Focusing on predictors, data sources, spatial and temporal scales, data-driven methods, and model evaluation, we performed a literature review based on 53 journal and conference papers published from 2018 to the present and concluded the following. (1) The predominant predictors include local climate conditions, historical dengue cases, vegetation indices, human mobility, population, internet search indices, social media indices, landscape, time index, and extreme weather events. (2) They are mainly derived from the official meteorological agency satellite-based datasets, public websites, department of health services and national electronic diseases surveillance systems, official statistics, and public transport datasets. (3) Country-level, province/state-level, city-level, district-level, and neighborhood-level are used as spatial scales, and the city-level scale received the most attention. The temporal scales include yearly, monthly, weekly, and daily, and both monthly and weekly are the most popular options. (4) Most studies define dengue risk forecasting as a regression task, and a few studies define it as a classification task. Data-driven methods can be categorized into single models, ensemble learning, and hybrid learning, with single models being further subdivided into time series, machine learning, and deep learning models. (5) Model evaluation concentrates primarily on the quantification of the difference/correlation between time-series observations and predicted values, the ability of models to determine whether a dengue outbreak occurs or not, and model uncertainty. Finally, we highlighted the importance of big geospatial data, data cloud computing, and other deep learning models in future dengue risk forecasting.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available