4.7 Article

Fuzzy C-Means clustering of incomplete data based on probabilistic information granules of missing values

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 99, Issue -, Pages 51-70

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2016.01.048

Keywords

Fuzzy clustering; Incomplete data; Missing value; Probabilistic information granules; Alternating optimization

Funding

  1. National Natural Science Foundation of China [61472062]
  2. National Key Technology Support Program of China [2015BAF20B02]
  3. Canada Research Chair (CRC) Program and Natural Sciences and Engineering Council of Canada (NSERC)

Ask authors/readers for more resources

Missing values are a common phenomenon when dealing with real-world data sets. Analysis of incomplete data sets has become an active area of research. In this paper, we focus on the problem of clustering incomplete data, which is intended to introduce some prior distribution information of the missing values into the algorithm of fuzzy clustering. First, non-parametric hypothesis testing is employed to describe the missing values adhering to a certain Gaussian distribution as probabilistic information granules based on the nearest neighbors of incomplete data. Second, we propose a novel clustering model, in which probabilistic information granules of missing values are incorporated into the Fuzzy C-Means clustering of incomplete data by involving the maximum likelihood criterion. Third, the clustering model is optimized by using a tri-level alternating optimization utilizing the method of Lagrange multipliers. The convergence and the time complexity of the clustering algorithm are also discussed. The experiments reported both on synthetic and real-world data sets demonstrate that the proposed approach can effectively realize clustering of incomplete data. (C) 2016 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available