4.6 Review

Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict

Journal

JOURNAL OF THE ROYAL SOCIETY INTERFACE
Volume 18, Issue 179, Pages -

Publisher

ROYAL SOC
DOI: 10.1098/rsif.2021.0104

Keywords

disease mapping; explanatory modelling; geostatistics; predictive modelling; prevalence; spatial correlation

Funding

  1. Springboard of the Academy of Medical Sciences [SBF004/1009]
  2. Royal Society Newton International Fellowship [NIF/R1/201418]
  3. DELTAS Africa Initiative [DEL-15-003]
  4. New Partnership for Africa's Development Planning and Coordinating Agency
  5. Wellcome Trust [107769, 211208, 103602, 212176, 203077]
  6. UK government
  7. UK's Department for International Development under a project entitled Strengthening the Use of Data for Malaria Decision Making in Africa (DFID) [203155]

Ask authors/readers for more resources

This paper provides statistical guidance on model-based geostatistical methods for disease prevalence mapping, illustrating the different stages of analysis through a case study on malaria mapping in Tanzania. It distinguishes between predictive and explanatory modelling, proposes a method for detecting over-fitting, and introduces the concept of domain effects for variable selection and model validation. The statistical ideas presented are widely applicable to regression models for epidemiological outcomes, with particular relevance to geostatistical models.
This paper provides statistical guidance on the development and application of model-based geostatistical methods for disease prevalence mapping. We illustrate the different stages of the analysis, from exploratory analysis to spatial prediction of prevalence, through a case study on malaria mapping in Tanzania. Throughout the paper, we distinguish between predictive modelling, whose main focus is on maximizing the predictive accuracy of the model, and explanatory modelling, where greater emphasis is placed on understanding the relationships between the health outcome and risk factors. We demonstrate that these two paradigms can result in different modelling choices. We also propose a simple approach for detecting over-fitting based on inspection of the correlation matrix of the estimators of the regression coefficients. To enhance the interpretability of geostatistical models, we introduce the concept of domain effects in order to assist variable selection and model validation. The statistical ideas and principles illustrated here in the specific context of disease prevalence mapping are more widely applicable to any regression model for the analysis of epidemiological outcomes but are particularly relevant to geostatistical models, for which the separation between fixed and random effects can be ambiguous.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available