4.7 Article

RT-GSOM: Rough tolerance growing self-organizing map

Journal

INFORMATION SCIENCES
Volume 566, Issue -, Pages 19-37

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2021.01.039

Keywords

Clustering algorithm; unsupervised learning; indiscernible reducts; rough tolerance information; GSOM

Funding

  1. Ministry of Education amp
  2. Ministry of Steel (GOI)
  3. Tata Steel Limited

Ask authors/readers for more resources

RT-GSOM is a novel algorithm that introduces the concept of rough tolerance set to reduce uncertainty in decision-making, address information loss and overlapping patterns of decision classes. By allowing the network to grow based on extracted data in an unsupervised manner, it demonstrates superior learning rate and cluster quality over other algorithms for both categorical and continuous data.
The concept of rough tolerance set is introduced within growing self-organizing map (GSOM) to reduce the uncertainty in decision-making by developing a new algorithm, namely rough tolerance GSOM (RT-GSOM). This algorithm aims to address the issues of (i) identifying the suitable size of clusters in SOM, (ii) information loss in rough SOM (RSOM), and (iii) uncertainty arising from the overlapping patterns of decision classes in GSOM. In RT-GSOM, the network is allowed to grow based on indiscernible reducts and tol-erance thresholds extracted from data in an unsupervised way. The network is initialized with the samples extracted from these reducts, and the weights are initialized with ran-dom category index. For each decision class, one set of indiscernible reducts is obtained. The tolerance threshold for each decision class is defined using the average distance among all the samples present in the reduct set corresponding to the same class. The superiority of RT-GSOM is demonstrated over twelve benchmark datasets (both categorical and continu-ous) obtained from UCI machine learning repository. Results reveal that RT-GSOM is effi-cient than some state-of-the-art algorithms in terms of learning rate, and quality of clusters for both categorical, and continuous data. (c) 2021 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available