4.7 Article

A mixed spatial prediction model in estimating spatiotemporal variations in benzene concentrations in Taiwan

Journal

CHEMOSPHERE
Volume 301, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.chemosphere.2022.134758

Keywords

Benzene; Land-use regression (LUR); Machine learning; Mixed spatial prediction; Benzene; Land-use regression (LUR); Machine learning; Mixed spatial prediction

Funding

  1. Ministry of Science and Technology, Taiwan [MOST 108-2621-M-006-017, MOST 108-2638-B-006-001-MY2]
  2. National Health Research Institute, Taiwan [NHRI-109A1-EMCO-02202312]

Ask authors/readers for more resources

This study is the first to predict spatial-temporal variations in benzene concentrations for the entirety of Taiwan using a mixed spatial prediction model and multiple machine learning algorithms. The results show the value of the proposed ensemble-based model for estimating spatiotemporal variation in benzene exposure.
It is well known benzene negatively impacts human health. This study is the first to predict spatial-temporal variations in benzene concentrations for the entirety of Taiwan by using a mixed spatial prediction model integrating multiple machine learning algorithms and predictor variables selected by Land-use Regression (LUR). Monthly benzene concentrations from 2003 to 2019 were utilized for model development, and monthly benzene concentration data from 2020, as well as mobile monitoring vehicle data from 2009 to 2019, served as external data for verifying model reliability. Benzene concentrations were estimated by running six LUR-based machine learning algorithms; these algorithms, which include random forest (RF), deep neural network (DNN), gradient boosting (GBoost), light gradient boosting (LightGBM), CatBoost, extreme gradient boosting (XGBoost), and ensemble algorithms (a combination of the three best performing models), can capture how nonlinear observations and predictions are related. The results indicated conventional LUR captured 79% of the variability in benzene concentrations. Notably, the LUR with ensemble algorithm (GBoost, CatBoost, and XGBoost) surpassed all other integrated methods, increasing the explanatory power to 92%. This study establishes the value of the proposed ensemble-based model for estimating spatiotemporal variation in benzene exposure.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available