☆ 4.6 Article

Large margin clustering on uncertain data by considering probability distribution similarity

NEUROCOMPUTING (2015)

Journal

NEUROCOMPUTING

Volume 158, Issue -, Pages 81-89

Publisher

ELSEVIER

DOI: 10.1016/j.neucom.2015.02.002

Keywords

Clustering; Uncertain data; Probability density function; Large margin; Histogram intersection kernel

Funding

Research Grants Council of the Hong Kong Special Administrative Region, China [PolyU 5182/08E, PolyU 5191/09E]
National Natural Science Foundation of China [61105054, 61222210, 61308027, 71101096]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

In this paper, the problem of clustering uncertain objects whose locations are uncertain and described by probability density functions (pdf) is studied. Though some existing methods (i.e. K-means, DBSCAN) have been extended to handle uncertain object clustering, there are still some limitations to be solved. K-means assumes that the objects are described by reasonably separated spherical balls. Thus, UK-means based on K-means is limited in handling objects which are in non-spherical shape. On the other hand, the probability density function is an important characteristic of uncertain data, but few existing clustering methods consider the difference between objects relying on probability density functions. Therefore, in this article, a clustering algorithm based on probability distribution similarity is proposed. Our method aims at finding the largest margin between clusters to overcome the limitation of UK-means. Extensively experimental results verify the performance of our method by effectiveness, efficiency and scalability on both synthetic and real data sets. (C) 2015 Elsevier B.V. All rights reserved.

Large margin clustering on uncertain data by considering probability distribution similarity

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Large margin clustering on uncertain data by considering probability distribution similarity

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper