4.6 Article

Large margin clustering on uncertain data by considering probability distribution similarity

Journal

NEUROCOMPUTING
Volume 158, Issue -, Pages 81-89

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2015.02.002

Keywords

Clustering; Uncertain data; Probability density function; Large margin; Histogram intersection kernel

Funding

  1. Research Grants Council of the Hong Kong Special Administrative Region, China [PolyU 5182/08E, PolyU 5191/09E]
  2. National Natural Science Foundation of China [61105054, 61222210, 61308027, 71101096]

Ask authors/readers for more resources

In this paper, the problem of clustering uncertain objects whose locations are uncertain and described by probability density functions (pdf) is studied. Though some existing methods (i.e. K-means, DBSCAN) have been extended to handle uncertain object clustering, there are still some limitations to be solved. K-means assumes that the objects are described by reasonably separated spherical balls. Thus, UK-means based on K-means is limited in handling objects which are in non-spherical shape. On the other hand, the probability density function is an important characteristic of uncertain data, but few existing clustering methods consider the difference between objects relying on probability density functions. Therefore, in this article, a clustering algorithm based on probability distribution similarity is proposed. Our method aims at finding the largest margin between clusters to overcome the limitation of UK-means. Extensively experimental results verify the performance of our method by effectiveness, efficiency and scalability on both synthetic and real data sets. (C) 2015 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available