4.7 Article

A spatial distance-based spatial clustering algorithm for sparse image data

Journal

ALEXANDRIA ENGINEERING JOURNAL
Volume 61, Issue 12, Pages 12609-12622

Publisher

ELSEVIER
DOI: 10.1016/j.aej.2022.06.0451110-0168

Keywords

Semantic image segmentation; Sparse data; Image clustering; Space clustering; Reduction; Synthetic aperture radar (SAR) satellite images

Funding

  1. National Natural Science Foundation --Outstanding Youth Foundation of China [72002067]
  2. Humanities and Social Science Research Pro- ject of Ministry of Education of China [20YJCZH081]
  3. Scientific Research Project of Education Department of Hubei Province [D20212701, D20202701, B2020385]

Ask authors/readers for more resources

Image classification aims to understand the attributes and features of each object in the scene by assigning them to predefined categories, and further explores the potential features and internal connections of the data. However, accurately classifying sparse data, especially small objects in images, is a challenge. This paper presents a spatial distance-based spatial clustering algorithm to address the issue of feature sparsity and imbalance in image data.
By allocating each object to one of the predefined categories, image classification deeply understands the attributes and features of the data on each object in the scene, and further mines the potential features and internal connections of the data, supporting the subsequent application decision-making with necessary structured data. One of the key challenges to image classification is how to accurately classify sparse data, when there is an imbalance between different categories of data, i.e., how to identify small objects in images. Recognizing a person in satellite images is such a challenging task. These objects are sparse either globally or in each recognizable local segment. Therefore, they are often overlooked by the classifier, or removed as noises. During deep learning, feature sparsity means the samples contain too much useless information, which suppresses the gen-eralization and accuracy of the model. To solve the problem, this paper presents a spatial distance -based spatial clustering algorithm for sparse image data (SDBSCA-SID). Firstly, the imaging range of the image sensor constitutes a two-dimensional (2D) constraint space. Under the constraint, spa-tial clustering was carried out based on the features of each sample to aggregate dense data into primary categories, and aggregate sparse data and noises into secondary categories. Referring to the 2D constrained space, multiple spatial classification surfaces were constructed to aggregate the sparse data to the two sides of these surfaces as much as possible. If the error is minimized, then the sparse data belong to these classification surfaces. To shorten the convergence time of the clus-tering algorithm on imbalanced data, the original sample set was cut into slices, and assigned to several calculation units for separate clustering. Next, the same-class clusters were merged through reduction. Finally, the obtained class labels were compared with the preset class labels, wrapping up the semantic segmentation of images. The stability and accuracy of our algorithm were demon-strated through tests on image samples. (c) 2022 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Alexandria University This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available