4.2 Article

A Constrained Feature Selection Approach Based on Feature Clustering and Hypothesis Margin Maximization

期刊

出版社

HINDAWI LTD
DOI: 10.1155/2021/5554873

关键词

-

资金

  1. Image Processing and Computer Graphics Department at Ho Chi Minh City Open University in Vietnam

向作者/读者索取更多资源

This paper introduces a semisupervised feature selection approach based on feature clustering and hypothesis margin maximization, which optimizes a semisupervised margin-based objective function to select the most discriminative features. Empirical validation on UCI benchmark datasets shows that it outperforms other semi-supervised and unsupervised methods, competing with widely used supervised methods in terms of classification accuracy and representation entropy.
In this paper, we propose a semisupervised feature selection approach that is based on feature clustering and hypothesis margin maximization. The aim is to improve the classification accuracy by choosing the right feature subset and to allow building more interpretable models. Our approach handles the two core aspects of feature selection, i.e., relevance and redundancy, and is divided into three steps. First, the similarity weights between features are represented by a sparse graph where each feature can be reconstructed from the sparse linear combination of the others. Second, features are then hierarchically clustered identifying groups of the most similar ones. Finally, a semisupervised margin-based objective function is optimized to select the most data discriminative feature from within each cluster, hence maximizing relevance while minimizing redundancy among features. Eventually, we empirically validate our proposed approach on multiple well-known UCI benchmark datasets in terms of classification accuracy and representation entropy, where it proved to outperform four other semisupervised and unsupervised methods and competed with two widely used supervised ones.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据