4.6 Review

Scalable Clustering Algorithms for Big Data: A Review

Journal

IEEE ACCESS
Volume 9, Issue -, Pages 80015-80027

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3084057

Keywords

Clustering algorithms; Big Data; Scalability; Partitioning algorithms; Data mining; Licenses; Classification algorithms; Clustering; unsupervised learning; traditional clustering; parallel clustering; stream clustering; high dimensional data; big data; large-scale

Ask authors/readers for more resources

In the era of big data, traditional clustering algorithms face high computational costs, making it challenging to accurately process massive amounts of data in crucial moments. Despite the development of different algorithms to facilitate clustering processes, there are still many difficulties when dealing with large data volumes.
Clustering algorithms have become one of the most critical research areas in multiple domains, especially data mining. However, with the massive growth of big data applications in the cloud world, these applications face many challenges and difficulties. Since Big Data refers to an enormous amount of data, most traditional clustering algorithms come with high computational costs. Hence, the research question is how to handle this volume of data and get accurate results at a critical time. Despite ongoing research work to develop different algorithms to facilitate complex clustering processes, there are still many difficulties that arise while dealing with a large volume of data. In this paper, we review the most relevant clustering algorithms in a categorized manner, provide a comparison of clustering methods for large-scale data and explain the overall challenges based on clustering type. The key idea of the paper is to highlight the main advantages and disadvantages of clustering algorithms for dealing with big data in a scalable approach behind the different other features.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available