4.3 Article

Quest for Optimal Regression Models in SARS-CoV-2 Wastewater Based Epidemiology

出版社

MDPI
DOI: 10.3390/ijerph182010778

关键词

regression; SARS-CoV-2; wastewater-based epidemiology; incidence; multivariate model; Taylor diagram

资金

  1. Austrian Federal Ministry of Education, Science and Research
  2. Austrian Federal Ministry of Agriculture, Regions and Tourism
  3. City of Vienna
  4. Federal State of Carinthia
  5. Federal State of Salzburg
  6. Federal State of Vorarlberg
  7. Austrian Association of Cities and Towns
  8. University Innsbruck

向作者/读者索取更多资源

Wastewater-based epidemiology is a valuable source of information for pandemic management, as shown in this study comparing SARS-CoV-2 signal derived from wastewater sampling with COVID-19 incidence values from individual testing programs. The study highlights that data pre-processing and multivariate model formulation are more crucial than the structure of the model for predicting viral incidence based on wastewater data.
Wastewater-based epidemiology is a recognised source of information for pandemic management. In this study, we investigated the correlation between a SARS-CoV-2 signal derived from wastewater sampling and COVID-19 incidence values monitored by means of individual testing programs. The dataset used in the study is composed of timelines (duration approx. five months) of both signals at four wastewater treatment plants across Austria, two of which drain large communities and the other two drain smaller communities. Eight regression models were investigated to predict the viral incidence under varying data inputs and pre-processing methods. It was found that population-based normalisation and smoothing as a pre-processing of the viral load data significantly influence the fitness of the regression models. Moreover, the time latency lag between the wastewater data and the incidence derived from the testing program was found to vary between 2 and 7 days depending on the time period and site. It was found to be necessary to take such a time lag into account by means of multivariate modelling to boost the performance of the regression. Comparing the models, no outstanding one could be identified as all investigated models are revealing a sufficient correlation for the task. The pre-processing of data and a multivariate model formulation is more important than the model structure.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据