4.7 Article

A multi-stage hierarchical clustering algorithm based on centroid of tree and cut edge constraint

期刊

INFORMATION SCIENCES
卷 557, 期 -, 页码 194-219

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2020.12.016

关键词

Hierarchical clustering; MST-based clustering; Geodesic distance; Centroid of tree; Inter-cluster distance; Cut edge constraint

向作者/读者索取更多资源

The paper introduces a novel hierarchical clustering algorithm based on minimum spanning tree (MST), which uses new ideas in computing inter-cluster distances and merging subclusters. Through three stages of operations, the proposed method shows good performance in clustering based on experimental results.
The minimum spanning tree clustering algorithm is known to be capable of detecting clusters with irregular boundaries. The paper presents a novel hierarchical clustering algorithm based on minimum spanning tree (MST), which tends to reduce the complexity of the merging process with guaranteed clustering performance. There are two core ideas in the proposed method: (1) The inter-cluster distance is calculated with the centroid of MST instead of the center of cluster. (2) The length of cut edge at the intersection of two adjacent clusters is taken as a merge condition. Based on this idea, we propose a three-stage MST-based hierarchical clustering algorithm (CTCEHC). In Stage 1, a preliminary partition is performed with the degrees of vertices in MST. In Stage 2, small subclusters are merged via the geodesic distance between the centroids of MST in two clusters and the cut edge constraint I. In Stage 3, the adjacent cluster pairs satisfying the cut edge constraint II are merged. The experimental results on the synthetic data sets and real data sets demonstrate a good performance of the proposed clustering method. (C) 2020 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据