4.6 Article

Scalable clustering for EO data using efficient raster representation

期刊

MULTIMEDIA TOOLS AND APPLICATIONS
卷 82, 期 8, 页码 12303-12319

出版社

SPRINGER
DOI: 10.1007/s11042-022-13726-x

关键词

Compact Data Structure; k(2)-raster; Earth observation data; Scalable clustering

向作者/读者索取更多资源

Earth Observation (EO) data is a valuable source of information that can be used in various applications. To uncover hidden information, unsupervised learning techniques such as clustering are commonly used. However, traditional clustering algorithms have limitations when processing large EO data due to memory constraints. In this study, we propose a compressed data structure called k(2)-raster for clustering raster data, which improves processing efficiency.
Earth Observation (EO) data is a source of a wide range of information, in vegetation, oceanography, land use, land cover and many more applications. To uncover the hidden information in the data, unsupervised learning techniques like clustering is used popularly. With technological advancements, the amount of data received through satellites rises exponentially, possessing the properties of Big Data. Traditional implementations of clustering algorithms have processing limitations based on the memory capability of the system. In general, applying any clustering algorithm directly to the large EO raster data requires a considerably large amount of time, due to the spatio-temporal nature of the data. The data generated by remote sensors have significant redundant values, so we have made an attempt to use compressed raster data for clustering without decompressing it. In this work, we present the compact data structure, known as k(2)-raster, based technique for clustering raster data. k(2)-raster preserves the spatial context in the image and provides time efficient and lossless compression. We have applied this technique to OCM2-NDVI, in GeoTIFF format, single and stacked image to develop a compressed dataset for efficient representation. As partition based clustering algorithms are widely used in clustering EO data because of their simplicity and time efficiency, we have examined the results on k-means and mini-batch k-means. We have also assessed the performance of our algorithm on model based clustering algorithms. It has been demonstrated that, for datasets with larger numbers of raster values, the developed compact data structure based approach works very well in terms of compact representation of data as well as, with both the partition based and model based clustering techniques making the clustering scalable.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据