4.7 Article

Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations

Journal

JOURNAL OF ENVIRONMENTAL MANAGEMENT
Volume 306, Issue -, Pages -

Publisher

ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD
DOI: 10.1016/j.jenvman.2022.114434

Keywords

Haze; Hierarchical clustering; Persistent homology; Time delay embedding; Topological data analysis

Funding

  1. Ministry of Education Malaysia [FRGS/1/2019/STG06/UKM/01/3]

Ask authors/readers for more resources

This study proposes a hybridization framework of HACA technique that evaluates the spatial patterns of areas affected by haze episodes by considering the topological similarity between stations. Results show that the inclusion of topological features improves the accuracy of air pollution behavior similarity assessment.
Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distance acts as the dissimilarity measure and air quality monitoring stations are grouped according to this measure, thus revealing the most polluted areas. In this study, a framework for the hybridization of the HACA technique is proposed by considering the topological similarity (Wasserstein distance) between stations to evaluate the spatial patterns of the affected areas by haze episodes. For this, a tool in the topological data analysis (TDA), namely, persistent homology, is used to extract essential topological features hidden in the dataset. The performance of the proposed method is compared with that of traditional HACA and evaluated based on its ability to categorize areas according to the exceedance level of the particulate matter (PM10). Results show that additional topological features have yielded better accuracy compared to without the case that does not consider topological features. The cluster validity indices are computed to verify the results, and the proposed method outperforms the traditional method, suggesting a practical alternative approach for assessing the similarity in air pollution behaviors based on topological characterizations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available