4.7 Article

Evaluation of Clustering Algorithms on HPC Platforms

期刊

MATHEMATICS
卷 9, 期 17, 页码 -

出版社

MDPI
DOI: 10.3390/math9172156

关键词

clustering algorithms; performance evaluation; GPU computing; energy-efficiency; vector architectures

资金

  1. Spanish Ministry of Science and Innovation [RYC2018-025580-I]
  2. Spanish Agencia Estatal de Investigacion [PID2020-112827GB-I00 /AEI/ 10.13039/501100011033, RTI2018-096384-B-I00, RTC-2017-6389-5, RTC2019-007159-5]
  3. Fundacion Seneca del Centro de Coordinacion de la Investigacion de la Region de Murcia [20813/PI/18]
  4. Conselleria de Educacion, Investigacion, Cultura y Deporte, Direccio General de Ciencia i Investigacio, Proyectos AICO/2020, Spain [AICO/2020/302]

向作者/读者索取更多资源

Clustering algorithms are widely used kernels for generating knowledge from large datasets by grouping data elements into clusters to identify patterns or common features. Fuzzy clustering algorithms, while computationally expensive, show different performance depending on platforms due to the high computational cost and the variation in algorithmic patterns.
Clustering algorithms are one of the most widely used kernels to generate knowledge from large datasets. These algorithms group a set of data elements (i.e., images, points, patterns, etc.) into clusters to identify patterns or common features of a sample. However, these algorithms are very computationally expensive as they often involve the computation of expensive fitness functions that must be evaluated for all points in the dataset. This computational cost is even higher for fuzzy methods, where each data point may belong to more than one cluster. In this paper, we evaluate different parallelisation strategies on different heterogeneous platforms for fuzzy clustering algorithms typically used in the state-of-the-art such as the Fuzzy C-means (FCM), the Gustafson-Kessel FCM (GK-FCM) and the Fuzzy Minimals (FM). The experimental evaluation includes performance and energy trade-offs. Our results show that depending on the computational pattern of each algorithm, their mathematical foundation and the amount of data to be processed, each algorithm performs better on a different platform.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据