4.2 Article

Applicability and Interpretability of Ward's Hierarchical Agglomerative Clustering With or Without Contiguity Constraints

期刊

JOURNAL OF CLASSIFICATION
卷 38, 期 2, 页码 363-389

出版社

SPRINGER
DOI: 10.1007/s00357-020-09377-y

关键词

Hierarchical agglomerative clustering; Ward's linkage; Contiguity constraint; Dendrogram; Monotonicity

资金

  1. INRAE/Inria doctoral program 2018
  2. SCALES project - CNRS (Mission Osez l'interdisciplinarit'e)

向作者/读者索取更多资源

This article discusses the various extensions of hierarchical agglomerative clustering (HAC) with Ward's linkage, applicability conditions, and different versions of graphical representation as dendrograms. It also highlights the distinction between the consistency property and absence of crossover within the dendrogram. The study shows that the constrained version of HAC can sometimes provide more relevant results despite optimizing the objective criterion on a reduced set of solutions.
Hierarchical agglomerative clustering (HAC) with Ward's linkage has been widely used since its introduction by Ward (Journal of the American Statistical Association, 58(301), 236-244,1963). This article reviews extensions of HAC to various input data and contiguity-constrained HAC, and provides applicability conditions. In addition, different versions of the graphical representation of the results as a dendrogram are also presented and their properties are clarified. We clarify and complete the results already available in an heterogeneous literature using a uniform background. In particular, this study reveals an important distinction between a consistency property of the dendrogram and the absence of crossover within it. Finally, a simulation study shows that the constrained version of HAC can sometimes provide more relevant results than its unconstrained version despite the fact that the constraint leads to optimize the objective criterion on a reduced set of solutions at each step. Overall, this article provides comprehensive recommendations, both for the use of HAC and constrained HAC depending on the input data, and for the representation of the results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据