4.7 Article

Reduced Clustering Method Based on the Inversion Formula Density Estimation

Journal

MATHEMATICS
Volume 11, Issue 3, Pages -

Publisher

MDPI
DOI: 10.3390/math11030661

Keywords

nonparametric density estimation; unsupervised machine learning; clustering; inversion formula; dimensions reduction

Categories

Ask authors/readers for more resources

Unsupervised learning, especially clustering methods, has numerous applications with a focus on finding hidden relationships between individual observations. This paper presents an extension to the clustering method using modified inversion formula density estimation, overcoming previous limitations and yielding improved results in higher dimensions. Comparative data analysis using over 20 data sets confirms the effectiveness of the developed method improvement. The new extended method outperforms popular data clustering methods, even approaching the accuracy of the best models, and shows positive impact on clustering results.
Unsupervised learning is one type of machine learning with an exceptionally high number of applications in various fields. The most popular and best-known group of unsupervised machine learning methods is clustering methods. The main goal of clustering is to find hidden relationships between individual observations. There is great interest in different density estimation methods, especially when there are outliers in the data. Density estimation also can be applied to data clustering methods. This paper presents the extension to the clustering method based on the modified inversion formula density estimation to solve previous method limitations. This new method's extension works within higher dimensions (d > 15) cases, which was the limitation of the previous method. More than 20 data sets are used in comparative data analysis to prove the effectiveness of the developed method improvement. The results showed that the new method extension positively affects the data clustering results. The new reduced clustering method, based on the modified inversion formula density estimation, outperforms popular data clustering methods on test data sets. In cases when the accuracy is not the best, the data clustering accuracy is close to the best models' obtained accuracies. Lower dimensionality data were used to compare the standard clustering based on the inversion formula density estimation method with the extended method. The new modification method has better results than the standard method in all cases, which confirmed the hypothesis about the new method's positive impact on clustering results.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available