4.7 Article

KR-DBSCAN: A density-based clustering algorithm based on reverse nearest neighbor and influence space

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 186, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2021.115763

Keywords

Density-based clustering; Cluster expansion; Reverse nearest neighborhood; Influence space; Core object

Funding

  1. National Natural Science Foundation of China (NSFC) [61876122]
  2. Scientific and Technological Innovation Team of Shanxi Province, P.R. China [201805D131007]

Ask authors/readers for more resources

KR-DBSCAN is a density-based clustering algorithm that distinguishes adjacent clusters with different densities effectively by defining a new cluster expansion condition using reverse nearest neighbors and their influence spaces. It substantially reduces computational load while identifying boundary objects and noise objects using k-nearest neighbors.
Density-based clustering is one of the most commonly used analysis methods in data mining and machine learning, with the advantage of locating non-ball-shaped clusters without specifying the number of clusters in advance. However, it has notable shortcomings, such as an inability to distinguish adjacent clusters of different densities. We propose a density-based clustering algorithm, KR-DBSCAN, which is based on the reverse nearest neighbor and influence space. The core objects are identified according to the reverse nearest neighborhood, and their influence spaces are determined by calculating the k-nearest neighborhood and reverse nearest neighborhood for each data object under the Euclidean distance metric. In particular, a new cluster expansion condition is defined using the reverse nearest neighborhood and its influence space, and when the core objects are within their influence spaces, they are added to the cluster by breadth-first traversal. As a result, adjacent clusters with different densities are effectively distinguished, and the computational load is substantially reduced. Boundary objects and noise objects are identified, also using k-nearest neighbors. KR-DBSCAN is experimentally validated on the UCI dataset and some synthetic datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available