4.6 Article

Feature selection with effective distance

Journal

NEUROCOMPUTING
Volume 215, Issue -, Pages 100-109

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2015.07.155

Keywords

Feature selection; Effective distance; Classification; Clustering; Sparse representation

Funding

  1. National Natural Science Foundation of China [61422204, 61473149, 61402320]
  2. Jiangsu Natural Science Foundation [BK20130034]
  3. Specialized Research Fund for the Doctoral Program of Higher Education [20123218110009]
  4. NUAA Fundamental Research Funds [NE2013105]
  5. Natural Science Foundation of Shandong Province of China [ZR2013FQ029]

Ask authors/readers for more resources

As more features are introduced in pattern recognition and machine learning applications, feature selection remains a critically important task to find the most compact representation of data, especially in unsupervised learning scenarios without enough class labels. Although there are a number of unsupervised feature selection methods in the literature, most of existing methods only focus on using conventional distances (e.g., Euclidean distance) to measure the similarity between two samples, which could not capture the dynamic structure of data due to the static characteristics of conventional distances. To reflect the dynamic structure of data, in this paper, we propose a set of effective distance-based feature selection methods, where a probabilistically motivated effective distance is used to measure the similarity of samples. Specifically, we first develop a sparse representation-based algorithm to compute the effective distance. Then, we propose three new filter-type unsupervised feature selection methods using effective distance, including an effective distance-based Laplacian Score (EDLS), and two effective distance-based Sparsity Scores (i.e., EDSS-1, and EDSS-2). Experimental results of clustering and classification tasks on a series of benchmark data sets show that our effective distance-based feature selection methods can achieve better performance than conventional methods using Euclidean distance. (C) 2016 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available