☆ 4.6 Article

Performance Analysis and Architecture of a Clustering Hybrid Algorithm Called FA plus GA-DBSCAN Using Artificial Datasets

ENTROPY (2022)

期刊

ENTROPY

卷 24, 期 7, 页码 -

出版社

MDPI

DOI: 10.3390/e24070875

关键词

clustering; DBSCAN; factor analysis; genetic algorithm; pattern recognition; entropy

类别

Physics, Multidisciplinary

资金

Centro de Investigacion para el Desarrollo y la Innovacion CIDI from Universidad Pontificia Bolivariana Sede Central [636B-06/16-57]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The article presents a clustering hybrid algorithm for automatically grouping datasets into a two-dimensional space using DBSCAN, with parameters MinPts and Eps automated using nearest neighbor and a genetic algorithm. Factor Analysis was used for preprocessing high-dimensional datasets. The performance of the algorithm, FA+GA-DBSCAN, was evaluated using artificial datasets, showing lower error probability in clustering denser datasets.

Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is a widely used algorithm for exploratory clustering applications. Despite the DBSCAN algorithm being considered an unsupervised pattern recognition method, it has two parameters that must be tuned prior to the clustering process in order to reduce uncertainties, the minimum number of points in a clustering segmentation MinPts, and the radii around selected points from a specific dataset Eps. This article presents the performance of a clustering hybrid algorithm for automatically grouping datasets into a two-dimensional space using the well-known algorithm DBSCAN. Here, the function nearest neighbor and a genetic algorithm were used for the automation of parameters MinPts and Eps. Furthermore, the Factor Analysis (FA) method was defined for pre-processing through a dimensionality reduction of high-dimensional datasets with dimensions greater than two. Finally, the performance of the clustering algorithm called FA+GA-DBSCAN was evaluated using artificial datasets. In addition, the precision and Entropy of the clustering hybrid algorithm were measured, which showed there was less probability of error in clustering the most condensed datasets.

Performance Analysis and Architecture of a Clustering Hybrid Algorithm Called FA plus GA-DBSCAN Using Artificial Datasets

期刊

ENTROPY

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Performance Analysis and Architecture of a Clustering Hybrid Algorithm Called FA plus GA-DBSCAN Using Artificial Datasets

期刊

ENTROPY

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文