☆ 4.6 Article

A weighting k-modes algorithm for subspace clustering of categorical data

NEUROCOMPUTING (2013)

Journal

NEUROCOMPUTING

Volume 108, Issue -, Pages 23-30

Publisher

ELSEVIER

DOI: 10.1016/j.neucom.2012.11.009

Keywords

Subspace clustering; Weight; k-Modes algorithm; Categorical data

Funding

National Natural Science Foundation of China [71031006, 70971080, 60970014]
Special Prophase Project on National Key Basic Research and Development Program of China (973) [2011CB311805]
Natural Science Foundation of Shanxi [2010021016-2, 2010011021-1]
China Postdoctoral Science Foundation [2012M510046]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Traditional clustering algorithms consider all of the dimensions of an input data set equally. However, in the high dimensional data, a common property is that data points are highly clustered in subspaces, which means classes of objects are categorized in subspaces rather than the entire space. Subspace clustering is an extension of traditional clustering that seeks to find clusters in different subspaces within a data set. In this paper, a weighting k-modes algorithm is presented for subspace clustering of categorical data and its corresponding time complexity is analyzed as well. In the proposed algorithm, an additional step is added to the k-modes clustering process to automatically compute the weight of all dimensions in each cluster by using complement entropy. Furthermore, the attribute weight can be used to identify the subsets of important dimensions that categorize different clusters. The effectiveness of the proposed algorithm is demonstrated with real data sets and synthetic data sets. (C) 2012 Elsevier B.V. All rights reserved.

A weighting k-modes algorithm for subspace clustering of categorical data

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A weighting k-modes algorithm for subspace clustering of categorical data

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper