4.6 Article

Improved prediction accuracy for disease risk mapping using Gaussian process stacked generalization

Journal

JOURNAL OF THE ROYAL SOCIETY INTERFACE
Volume 14, Issue 134, Pages -

Publisher

ROYAL SOC
DOI: 10.1098/rsif.2017.0520

Keywords

Gaussian process; malaria; disease mapping; stacked generalization

Funding

  1. MRC Outbreak Centre
  2. Bill and Melinda Gates Foundation [OPP1152978, OPP1110495, OPP1068048, OPP1106023]
  3. National Institutes of Health/National Institute of Allergy and Infectious Diseases [U19AI089674]
  4. Research and Policy for Infectious Disease Dynamics (RAPIDD) programme of the Science and Technology Directorate, Department of Homeland Security
  5. Fogarty International Center, National Institutes of Health
  6. UK Medical Research Council (MRC) [K00669X]
  7. UK Department for International Development (DFID) under the MRC/DFID Concordat agreement
  8. European Union
  9. MRC [MR/K00669X/1] Funding Source: UKRI
  10. Medical Research Council [MR/K010174/1B, MR/K00669X/1] Funding Source: researchfish
  11. Bill and Melinda Gates Foundation [OPP1152978, OPP1110495] Funding Source: Bill and Melinda Gates Foundation

Ask authors/readers for more resources

Maps of infectious disease-charting spatial variations in the force of infection, degree of endemicity and the burden on human health-provide an essential evidence base to support planning towards global health targets. Contemporary disease mapping efforts have embraced statistical modelling approaches to properly acknowledge uncertainties in both the available measurements and their spatial interpolation. The most common such approach is Gaussian process regression, a mathematical framework composed of two components: a mean function harnessing the predictive power of multiple independent variables, and a covariance function yielding spatio-temporal shrinkage against residual variation from the mean. Though many techniques have been developed to improve the flexibility and fitting of the covariance function, models for the mean function have typically been restricted to simple linear terms. For infectious diseases, known to be driven by complex interactions between environmental and socio-economic factors, improved modelling of the mean function can greatly boost predictive power. Here, we present an ensemble approach based on stacked generalization that allows for multiple nonlinear algorithmic mean functions to be jointly embedded within the Gaussian process framework. We apply this method to mapping Plasmodium falciparum prevalence data in sub-Saharan Africa and show that the generalized ensemble approach markedly outperforms any individual method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available