4.6 Article

Feature selection using self-information and entropy-based uncertainty measure for fuzzy neighborhood rough set

期刊

COMPLEX & INTELLIGENT SYSTEMS
卷 8, 期 1, 页码 287-305

出版社

SPRINGER HEIDELBERG
DOI: 10.1007/s40747-021-00356-3

关键词

Fuzzy neighborhood rough set; Feature selection; Self-information; Fuzzy neighborhood joint entropy; Uncertainty measure

资金

  1. National Natural Science Foundation of China [61976082, 61976120, 62002103]
  2. Key Scientific and Technological Projects of Henan Province [202102210165]

向作者/读者索取更多资源

Feature selection based on the fuzzy neighborhood rough set model (FNRS) is popular in data mining, but it may lead to the loss of information due to the dependency function only considering the lower approximation of the decision. This paper proposes a fuzzy neighborhood joint entropy model (FNSIJE) to address this problem, introducing uncertain fuzzy neighborhood self-information measures of decision variables and an uncertainty measure based on fuzzy neighborhood joint entropy for feature selection. The model shows better classification performance and can reduce dimensionality effectively.
Feature selection based on the fuzzy neighborhood rough set model (FNRS) is highly popular in data mining. However, the dependent function of FNRS only considers the information present in the lower approximation of the decision while ignoring the information present in the upper approximation of the decision. This construction method may lead to the loss of some information. To solve this problem, this paper proposes a fuzzy neighborhood joint entropy model based on fuzzy neighborhood self-information measure (FNSIJE) and applies it to feature selection. First, to construct four uncertain fuzzy neighborhood self-information measures of decision variables, the concept of self-information is introduced into the upper and lower approximations of FNRS from the algebra view. The relationships between these measures and their properties are discussed in detail. It is found that the fourth measure, named tolerance fuzzy neighborhood self-information, has better classification performance. Second, an uncertainty measure based on the fuzzy neighborhood joint entropy has been proposed from the information view. Inspired by both algebra and information views, the FNSIJE is proposed. Third, the K-S test is used to delete features with weak distinguishing performance, which reduces the dimensionality of high-dimensional gene datasets, thereby reducing the complexity of high-dimensional gene datasets, and then, a forward feature selection algorithm is provided. Experimental results show that compared with related methods, the presented model can select less important features and have a higher classification accuracy.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据