4.6 Article

A three-way clustering method based on an improved DBSCAN algorithm

Journal

Publisher

ELSEVIER
DOI: 10.1016/j.physa.2019.122289

Keywords

Decision support; Three-way decision; Clustering; DBSCAN

Funding

  1. Aviation Science Fund of China [2016ZC53028]
  2. Seed Foundation of Innovation and Creation for Graduate Students in Northwestern Polytechnical University, China [ZZ2019030]
  3. National Natural Science Foundation of China [61872297]

Ask authors/readers for more resources

Clustering is a fundamental research field and plays an important role in data analysis. To better address the relationship between an element and a cluster, a Three-Way clustering method based on an Improved DBSCAN (3W-DBSCAN) algorithm is proposed in this paper. 3W-DBSCAN represents a cluster by a pair of nested sets called lower bound and upper bound respectively. The two bounds classify objects into three status: belong-to, not belong-to and ambiguity. Objects in lower bound certainly belong to the cluster. Objects in upper bound while not in the lower bound are ambiguous because they are in a boundary region and might belong to one or more clusters. Objects beyond the upper bound certainly do not belong to the same cluster. This clustering representation can well explain the clustering result and consist with human cognitive thinking. By improving similarity calculation, improved DBSCAN is presented to obtain initial clustering results, then three-way decision strategies are used to acquire the positive and boundary regions of a cluster. Three benchmarks Accuracy (Acc), F-measure (F-1), NMI and ten datasets including three synthetic datasets, three UCI datasets and four shape datasets are used in experiments to evaluate the effectiveness of 3W-DBSCAN. Experimental results suggest that 3W-DBSCAN has a good performance and is effective in clustering. (C) 2019 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available