4.7 Article

Matrix representation of the conditional entropy for incremental feature selection on multi-source data

期刊

INFORMATION SCIENCES
卷 591, 期 -, 页码 263-286

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2022.01.037

关键词

Feature selection; Incremental larning; Rough sets; Matrix operators; Dynamic multi-source data

资金

  1. Youth Fund Project of Humanities and Social Science Research of Ministry of Education [21YJCZH045]
  2. National Science Foundation of China [61773324, 62076171]
  3. Fundamental Research Funds for the Central Universities [JBK2101001]
  4. Natural Science Foundation of Fujian Province [2020J01800]
  5. Joint Lab of Data Science and Business Intelligence at Southwestern University of Finance and Economics

向作者/读者索取更多资源

This paper proposes an incremental feature selection method based on the matrix representation of conditional entropy for dynamic multi-source data. A novel conditional entropy for multi-source data is introduced, and its properties are discussed. The matrix approach is employed to study the incremental mechanisms for computing conditional entropy.
In many real applications, the data are always collected from different information sources and are subject to evolve over time. Such data are referred to as dynamic multi-source data. How to efficiently select the informative features from dynamic multi-source data is a chal-lenging problem in data mining. Incremental feature selection with rough sets is an effec-tive method to select features from dynamic data. However, existing methods focus on single-source data and are not suitable for dynamic multi-source data with variations in data sources. To deal with this issue, we present an incremental feature selection method based on the matrix representation of the conditional entropy. We first propose a novel conditional entropy for multi-source data and discuss its properties, including the mono-tonicity and boundedness. Then, matrix characterization of the conditional entropy is pre-sented by employing the condition and decision relation matrices associated with some matrix operators. Finally, considering the addition and deletion of data sources in multi-source data, we employ the matrix approach to investigate the incremental mechanisms for the computation of the conditional entropy and develop the corresponding incremental feature selection algorithms. Extensive comparative experimental results are obtained to verify the effectiveness and efficiency of the proposed method. (c) 2022 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据