4.6 Article

Feature selection with missing labels based on label compression and local feature correlation

Journal

NEUROCOMPUTING
Volume 395, Issue -, Pages 95-106

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2019.12.059

Keywords

Multi-label learning; Incomplete labels; Feature selection; Label compression; Local feature correlation

Funding

  1. Natural Science Foundation of China [61873214, 61872300 and61871020]
  2. Fundamental ResearchFunds for the Central Universities [XDJK2020B028, XDJK2019B024]
  3. National Key Research and Development Plan Task of China [2016YFC0901902]
  4. Natural Science Foundation of CQ CSTC [cstc2018jcyjAX0228]

Ask authors/readers for more resources

Feature selection can efficiently alleviate the issue of curse of dimensionality, especially for multi-label data with multiple features to embody diverse semantics. Although many supervised feature selection methods have been proposed, they generally assume the labels of training data are complete, whilst we only have data with incomplete labels in many real applications. Some methods try to select features with missing labels of training data, they still can not handle feature selection with a large and sparse label space. In addition, these approaches focus on global feature correlations, but some feature correlations are local and shared by a subset of data. In this paper, we introduce an approach called Feature Selection with missing labels based on Label Compression and Local feature Correlation (FSLCLC for short). FSLCLC adopts the low-rank matrix factorization on the sparse sample-label association matrix to compress labels and recover the missing labels in the compressed label space. In addition, it utilizes sparsity regularization and local feature correlation induced manifold regularizations to select the discriminative features. To solve the joint optimization objective for label compression, recovering missing labels and feature selection, we develop an iterative algorithm with guaranteed convergence. Experimental results on benchmark datasets show that the proposed FSLCLC outperforms the state-of-the-art multi-label feature selection algorithms. (C) 2020 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available