4.7 Article

Selecting feature subset with sparsity and low redundancy for unsupervised learning

期刊

KNOWLEDGE-BASED SYSTEMS
卷 86, 期 -, 页码 210-223

出版社

ELSEVIER
DOI: 10.1016/j.knosys.2015.06.008

关键词

Unsupervised feature selection; Nonnegative spectral analysis; Sparsity and low redundancy

资金

  1. National Natural Science Foundation of China [61303179]
  2. Hundred Talents Program (Chinese Academy of Sciences) [Y3S4011D31]

向作者/读者索取更多资源

Feature selection techniques are attracting more and more attention with the growing number of domains that produce high dimensional data. Due to the absence of class labels, many researchers focus on the unsupervised scenario, attempting to find an optimal feature subset that preserves the original data distribution. However, the existing methods either fail to achieve sparsity or ignore the potential redundancy among features. In this paper, we propose a novel unsupervised feature selection algorithm, which retains the preserving power, and implements high sparsity and low redundancy in a unified manner. On the one hand, to preserve the data structure of the whole feature set, we build the graph Laplacian matrix and learn the pseudo class labels through spectral analysis. By finding a feature weight matrix, we are allowed to map the original data into a low dimensional space based on the pseudo labels. On the other hand, to ensure the sparsity and low redundancy simultaneously, we introduce a novel regularization term into the objective function with the nonnegative constraints imposed, which can be viewed as the combination of the matrix norms parallel to.parallel to(m1) and parallel to.parallel to(m2) on the weights of features. An iterative multiplicative algorithm is accordingly designed with proved convergence to efficiently solve the constrained optimization problem. Extensive experimental results on different real world data sets demonstrate the promising performance of our proposed method over the state-of-the-arts. (C) 2015 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据