4.6 Article

Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe

期刊

BMC PUBLIC HEALTH
卷 21, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s12889-020-10106-8

关键词

Syndromic surveillance; Measles; Linear regression; Forecasting; Programming languages; Computational science

资金

  1. University of Alcala de Henares

向作者/读者索取更多资源

New forms of syndromic surveillance using Internet data have been proposed in recent years to assist in the early prediction of epidemics. This study examines forecasting measles using official data on measles and Google Trends data, showing that measles can be estimated and predicted accurately through Google Trends. The results indicate a strong relationship between the predicted and actual measles cases, with a mean standard error of 45.2 (12.19%) for the combined results.
Background In recent years new forms of syndromic surveillance that use data from the Internet have been proposed. These have been developed to assist the early prediction of epidemics in various cases and diseases. It has been found that these systems are accurate in monitoring and predicting outbreaks before these are observed in population and, therefore, they can be used as a complement to other methods. In this research, our aim is to examine a highly infectious disease, measles, as there is no extensive literature on forecasting measles using Internet data, Methods This research has been conducted with official data on measles for 5 years (2013-2018) from the competent authority of the European Union (European Center of Disease and Prevention - ECDC) and data obtained from Google Trends by using scripts coded in Python. We compared regression models forecasting the development of measles in the five countries. Results Results show that measles can be estimated and predicted through Google Trends in terms of time, volume and the overall spread. The combined results reveal a strong relationship of measles cases with the predicted cases (correlation coefficient R= 0.779 in two-tailed significance p< 0.01). The mean standard error was relatively low 45.2 (12.19%) for the combined results. However, major differences and deviations were observed for countries with a relatively low impact of measles, such as the United Kingdom and Spain. For these countries, alternative models were tested in an attempt to improve the results. Conclusions The estimation of measles cases from Google Trends produces acceptable results and can help predict outbreaks in a robust and sound manner, at least 2 months in advance. Python scripts can be used individually or within the framework of an integrated Internet surveillance system for tracking epidemics as the one addressed here.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据