4.7 Article

Outlier-robust multi-view clustering for uncertain data

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 211, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2020.106567

Keywords

Multi-view clustering; Uncertain data; Density estimation; k-medoids; S-divergence

Funding

  1. project Prediction of diseases through computer assisted diagnosis system using images captured by minimally-invasive and non-invasive modalities'', Computer Science and Engineering, PDPM Indian Institute of Information Technology, Design and Manufacturin [SPARC-MHRD-231]

Ask authors/readers for more resources

Multi-view clustering is gaining more attention due to the presence of multiple views in real-world datasets, providing complementary and consensus information. An adaptive mixture similarity function based on geometric distance and S-divergence is introduced for uncertain data clustering, integrated with k-medoids to reduce the impact of outliers and noises. Extensive experimental results demonstrate the effectiveness and robustness of the proposed method against noise and outliers.
Nowadays, multi-view clustering is drawn more and more attention in the area of machine learning because real-world datasets frequently consist of multiple views. Moreover, it provides complementary and consensus information across multiple views. So, owing to the efficacy of revealing the concealed patterns in uncertain data, multiple views are considered in this study. But, a multi-view clustering algorithm is not alone sufficient to increase accuracy. A similarity measure is equally important in uncertain data clustering. However, existing similarity functions for clustering uncertain data afflict with several problems. Geometric distance-based similarity function cannot correctly capture the change between uncertain data with their distributions when they are massively location-wise overlapped. On the other hand, the divergence-based similarity function cannot discriminate against the change between various duos of absolutely disjointed uncertain data. Thus, a self-adaptive mixture similarity function based on geometric distance and S-divergence is introduced for uncertain data clustering. The proposed similarity function is integrated with k-medoids based multi-view clustering. The proposed method reduces the effect of outliers and noises since it uses the threshold-based residual objective function in k-medoids. Finally, extensive experimental results on synthetic and real-world uncertain datasets illustrate that the proposed method consistently defeats the state-of-the-art clustering algorithms. Experimental results also demonstrate the effectiveness and robustness of the proposed method against noise and outliers. (C) 2020 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available