4.7 Article

Robust Dual Clustering with Adaptive Manifold Regularization

Journal

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
Volume 29, Issue 11, Pages 2498-2509

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2017.2732986

Keywords

Clustering; matrix factorization; manifold regularization; dimension reduction

Funding

  1. National Natural Science Foundation of China [61711530239, 61471274]
  2. Hong Kong Government General Research Fund GRF [152202/14E]
  3. Hong Kong Scholars Program [XJ2015036]
  4. Australian Research Council [FL-170100117, DP-140102164, LP-150100671]

Ask authors/readers for more resources

In recent years, various data clustering algorithms have been proposed in the data mining and engineering communities. However, there are still drawbacks in traditional clustering methods which are worth to be further investigated, such as clustering for the high dimensional data, learning an ideal affinity matrix which optimally reveals the global data structure, discovering the intrinsic geometrical and discriminative properties of the data space, and reducing the noises influence brings by the complex data input. In this paper, we propose a novel clustering algorithm called robust dual clustering with adaptive manifold regularization (RDC), which simultaneously performs dual matrix factorization tasks with the target of an identical cluster indicator in both of the original and projected feature spaces, respectively. Among which, the l(2,1)-norm is used instead of the conventional l(2)-norm to measure the loss, which helps to improve the model robustness by relieving the influences by the noises and outliers. In order to better consider the intrinsic geometrical and discriminative data structure, we incorporate the manifold regularization term on the cluster indicator by using a particularly learned affinity matrix which is more suitable for the clustering task. Moreover, a novel augmented lagrangian method (ALM) based procedure is designed to effectively and efficiently seek the optimal solution of the proposed RDC optimization. Numerous experiments on the representative data sets demonstrate the superior performance of the proposed method compares to the existing clustering algorithms.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available