☆ 4.1 Article

Improvement of the Fast Clustering Algorithm Improved by K-Means in the Big Data

APPLIED MATHEMATICS AND NONLINEAR SCIENCES (2020)

Journal

APPLIED MATHEMATICS AND NONLINEAR SCIENCES

Volume 5, Issue 1, Pages 1-10

Publisher

WALTER DE GRUYTER GMBH

DOI: 10.2478/AMNS.2020.1.00001

Keywords

Big Data; Clustering; K-means; Feature space

Funding

Science and Technology Research Program of Chongqing Municipal Education Commission [KJ1709207]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Clustering as a fundamental unsupervised learning is considered an important method of data analysis, and K-means is demonstrably the most popular clustering algorithm. In this paper, we consider clustering on feature space to solve the low efficiency caused in the Big Data clustering by K-means. Different from the traditional methods, the algorithm guaranteed the consistency of the clustering accuracy before and after descending dimension, accelerated K-means when the clustering centeres and distance functions satisfy certain conditions, completely matched in the preprocessing step and clustering step, and improved the efficiency and accuracy. Experimental results have demonstrated the effectiveness of the proposed algorithm.

Improvement of the Fast Clustering Algorithm Improved by K-Means in the Big Data

Journal

APPLIED MATHEMATICS AND NONLINEAR SCIENCES

Publisher

WALTER DE GRUYTER GMBH

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Improvement of the Fast Clustering Algorithm Improved by K-Means in the Big Data

Journal

APPLIED MATHEMATICS AND NONLINEAR SCIENCES

Publisher

WALTER DE GRUYTER GMBH

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper