☆ 4.7 Article

Information-Theoretic Feature Selection via Tensor Decomposition and Submodularity

IEEE TRANSACTIONS ON SIGNAL PROCESSING (2021)

期刊

IEEE TRANSACTIONS ON SIGNAL PROCESSING

卷 69, 期 -, 页码 6195-6205

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TSP.2021.3125147

关键词

Tensors; Feature extraction; Mutual information; Random variables; Training; Task analysis; Mathematical models; Probability; tensor decomposition; feature selection; mutual information; submodular maximization

类别

Engineering, Electrical & Electronic

资金

NSF [IIS-1704074]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper introduces a method of feature selection through low-rank tensor modeling to mitigate complexity and maximize classification performance. By learning the "principal components" of the joint distribution to avoid the curse of dimensionality, and by using a greedy algorithm to tackle the feature selection problem.

Feature selection by maximizing high-order mutual information between the selected feature vector and a target variable is the gold standard in terms of selecting the best subset of relevant features that maximizes the performance of prediction models. However, such an approach typically requires knowledge of the multivariate probability distribution of all features and the target, and involves a challenging combinatorial optimization problem. Recent work has shown that any joint Probability Mass Function (PMF) can be represented as a naive Bayes model, via Canonical Polyadic (tensor rank) Decomposition. In this paper, we introduce a low-rank tensor model of the joint PMF of all variables and indirect targeting as a way of mitigating complexity and maximizing the classification performance for a given number of features. Through low-rank modeling of the joint PMF, it is possible to circumvent the curse of dimensionality by learning 'principal components' of the joint distribution. By indirectly aiming to predict the latent variable of the naive Bayes model instead of the original target variable, it is possible to formulate the feature selection problem as maximization of a monotone submodular function subject to a cardinality constraint - which can be tackled using a greedy algorithm that comes with performance guarantees. Numerical experiments with several standard datasets suggest that the proposed approach compares favorably to the state-of-art for this important problem.

Information-Theoretic Feature Selection via Tensor Decomposition and Submodularity

期刊

IEEE TRANSACTIONS ON SIGNAL PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Information-Theoretic Feature Selection via Tensor Decomposition and Submodularity

期刊

IEEE TRANSACTIONS ON SIGNAL PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文