4.6 Article

Swarm Intelligence Algorithms in Text Document Clustering with Various Benchmarks

期刊

SENSORS
卷 21, 期 9, 页码 -

出版社

MDPI
DOI: 10.3390/s21093196

关键词

swarm intelligence algorithms; text document clustering; artificial intelligence; data mining

资金

  1. BK21 FOUR (Fostering Outstanding Universities for Research) - Ministry of Education (MOE, Korea)
  2. National Research Foundation of Korea (NRF)

向作者/读者索取更多资源

Text document clustering involves classifying textual documents into clusters based on content similarity. Swarm intelligence algorithms use simple rules to tackle complex tasks, with PSO and GWO algorithms outperforming K-means in document clustering.
Text document clustering refers to the unsupervised classification of textual documents into clusters based on content similarity and can be applied in applications such as search optimization and extracting hidden information from data generated by IoT sensors. Swarm intelligence (SI) algorithms use stochastic and heuristic principles that include simple and unintelligent individuals that follow some simple rules to accomplish very complex tasks. By mapping features of problems to parameters of SI algorithms, SI algorithms can achieve solutions in a flexible, robust, decentralized, and self-organized manner. Compared to traditional clustering algorithms, these solving mechanisms make swarm algorithms suitable for resolving complex document clustering problems. However, each SI algorithm shows a different performance based on its own strengths and weaknesses. In this paper, to find the best performing SI algorithm in text document clustering, we performed a comparative study for the PSO, bat, grey wolf optimization (GWO), and K-means algorithms using six data sets of various sizes, which were created from BBC Sport news and 20 newsgroups. Based on our experimental results, we discuss the features of a document clustering problem with the nature of SI algorithms and conclude that the PSO and GWO SI algorithms are better than K-means, and among those algorithms, the PSO performs best in terms of finding the optimal solution.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据