☆ 4.5 Article

k-Means Clustering Algorithm and Its Simulation Based on Distributed Computing Platform

COMPLEXITY (2021)

Journal

COMPLEXITY

Volume 2021, Issue -, Pages -

Publisher

WILEY-HINDAWI

DOI: 10.1155/2021/9446653

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper extensively studies the parallel k-means algorithm, which speeds up the efficiency of the algorithm by parallelizing the distance calculation and data object clustering processes, showing efficient and stable service provision with good convergence.

At present, the explosive growth of data and the mass storage state have brought many problems such as computational complexity and insufficient computational power to clustering research. The distributed computing platform through load balancing dynamically configures a large number of virtual computing resources, effectively breaking through the bottleneck of time and energy consumption, and embodies its unique advantages in massive data mining. This paper studies the parallel k-means extensively. This article first initializes random sampling and second parallelizes the distance calculation process that provides independence between the data objects to perform cluster analysis in parallel. After the parallel processing of the MapReduce, we use many nodes to calculate distance, which speeds up the efficiency of the algorithm. Finally, the clustering of data objects is parallelized. Results show that our method can provide services efficiently and stably and have good convergence.

k-Means Clustering Algorithm and Its Simulation Based on Distributed Computing Platform

Journal

COMPLEXITY

Publisher

WILEY-HINDAWI

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

k-Means Clustering Algorithm and Its Simulation Based on Distributed Computing Platform

Journal

COMPLEXITY

Publisher

WILEY-HINDAWI

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper