4.7 Article

Machine and deep learning for modelling heat-health relationships

Journal

SCIENCE OF THE TOTAL ENVIRONMENT
Volume 892, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.scitotenv.2023.164660

Keywords

Mortality; Temperature; Humidity; Air pollution; Machine learning; Deep learning

Ask authors/readers for more resources

Extreme heat events are a significant threat to population health, especially with the amplification of climate change. Traditional statistical models do not consider potential interactions between temperature-related and air pollution predictors. Artificial intelligence methods have the potential to account for these interactions, but they have been underutilized in studying the impacts of heat on health.
Extreme heat events pose a significant threat to population health that is amplified by climate change. Traditionally, statistical models have been used to model heat-health relationships, but they do not consider potential interactions between temperature-related and air pollution predictors. Artificial intelligence (AI) methods, which have gained popularity for health applications in recent years, can account for these complex and non-linear interactions, but have been underutilized in modelling heat-related health impacts. In this paper, six machine and deep learning models were considered to model the heat-mortality relationship in Montreal (Canada) and compared to three statistical models commonly used in the field. Decision Tree (DT), Random Forest (RF), Gradient Boosting Machine (GBM), Single- and Multi-Layer Perceptrons (SLP and MLP), Long Short-Term Memory (LSTM), Generalized Linear and Additive Models (GLM and GAM), and Distributed Lag Non-Linear Model (DLNM) were employed. Heat exposure was characterized by air temperature, relative humidity and wind speed, while air pollution was also included in the models using five pollutants. The results confirmed that air temperature at lags of up to 3 days was the most important variable for the heat-mortality relationship in all models. NO2 concentration and relative humidity (at lags 1 to 3 days) were also particularly important. Ensemble tree-based methods (GBM and RF) outperformed other approaches to model daily mortality during summer months based on three performance criteria. However, a partial validation during two recent major heatwaves highlighted that non-linear statistical models (GAM and DLNM) and simpler decision tree may more closely reproduce the spike of mortality observed during such events. Hence, both machine learning and statistical models are relevant for modelling heat-health relationships depending on the end user goal. Such extensive comparative analysis should be extended to other health outcomes and regions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available