期刊
INFORMATION SCIENCES
卷 527, 期 -, 页码 279-292出版社
ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2020.04.016
关键词
Data mining; Clustering; Community detection; Reciprocal nearest neighbor
资金
- National Natural Science Foundation of China [11975071, 61673085, 61433014]
- Science Strength Promotion Program of the UESTC [Y03111023901014006]
- Fundamental Research Funds for the Central Universities [ZYGX2016J196]
Clustering is a fundamental tool aiming at classifying data points into groups based on their pairwise distances or similarities. It has found successful applications in all natural and social sciences, including biology, physics, economics, chemistry, astronomy, psychology, and so on. Among various types of algorithms, hierarchical clustering is of particular advantages as it can provide results under different resolutions without the knowledge of a predetermined number of clusters. At the same time, it is usually time-consuming or inaccurate. In this paper, we propose a novel hierarchical clustering algorithm on the basis of a simple hypothesis that two reciprocal nearest data points should be grouped in one cluster. Extensive tests on data sets across multiple domains show that our method is much faster and more accurate than the state-of-the-art benchmarks. We further extend our method to deal with the community detection problem in real networks, achieving remarkably better results than the well-known Girvan-Newman algorithm. (C) 2020 Elsevier Inc. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据