4.5 Article

Strategies for EELS Data Analysis. Introducing UMAP and HDBSCAN for Dimensionality Reduction and Clustering

Journal

MICROSCOPY AND MICROANALYSIS
Volume 28, Issue 1, Pages 109-122

Publisher

OXFORD UNIV PRESS
DOI: 10.1017/S1431927621013696

Keywords

clustering; dimensionality reduction; EELS; HDBSCAN; UMAP

Funding

  1. Spanish Ministry of Science and Innovation (MICINN) [PID2019-106165GB-C21]
  2. Spanish Research Network [RED2018-102609-T]
  3. Spanish ministry of environment (MITECO) [PID2019-107106RB-C3119S01452-006]
  4. Generalitat de Catalunya [MIND-2017 SGR 776]
  5. FI-AGAUR Research Fellowship Program [2018FI_B_00360]

Ask authors/readers for more resources

In this study, two new clustering analysis methods and dimensionality reduction methods (HDBSCAN and UMAP) were proposed for the segmentation of core-loss electron energy loss spectroscopy (EELS) spectrum images. The performances of HDBSCAN and UMAP were systematically compared to other methods using synthetic and experimental datasets, and better results were obtained. The results also showed that the combination of different algorithms can provide a more comprehensive understanding of the dataset studied.
Hierarchical density-based spatial clustering of applications with noise (HDBSCAN) and uniform manifold approximation and projection (UMAP), two new state-of-the-art algorithms for clustering analysis, and dimensionality reduction, respectively, are proposed for the segmentation of core-loss electron energy loss spectroscopy (EELS) spectrum images. The performances of UMAP and HDBSCAN are systematically compared to the other clustering analysis approaches used in EELS in the literature using a known synthetic dataset. Better results are found for these new approaches. Furthermore, UMAP and HDBSCAN are showcased in a real experimental dataset from a core-shell nanoparticle of iron and manganese oxides, as well as the triple combination non-negative matrix factorization-UMAP-HDBSCAN. The results obtained indicate how the complementary use of different combinations may be beneficial in a real-case scenario to attain a complete picture, as different algorithms highlight different aspects of the dataset studied.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available