☆ 4.5 Article

Cost-sensitive feature selection on multi-label data via neighborhood granularity and label enhancement

APPLIED INTELLIGENCE (2021)

期刊

APPLIED INTELLIGENCE

卷 51, 期 4, 页码 2210-2232

出版社

SPRINGER

DOI: 10.1007/s10489-020-01993-w

关键词

Feature selection; Cost-sensitive; Label enhancement; Neighborhood granularity; Multi-label data

类别

Computer Science, Artificial Intelligence

资金

National Natural Science Foundation of China [61966016, 61662023]
Natural Science Foundation of Jiangxi Province [20192BAB207018]
Scientific Research Project of Education department of Jiangxi Province [GJJ180200]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study proposes a novel test cost multi-label feature selection algorithm that enhances traditional logical labels into label distribution forms using neighborhood granularity and considers the effect of test cost under different distributions. Experimental results demonstrate that the algorithm achieves superior performance on multiple publicly available datasets.

Multi-label feature selection, which is an efficient and effective pre-processing step in machine learning and data mining, can select a feature subset that contains more contributions for multi-label classification while improving the performance of the classifiers. In real-world applications, an instance may be associated with multiple related labels with different relative importances, and the process of obtaining different features usually requires different costs, containing money, and time, etc. However, most existing works with regard to multi-label feature selection do not take into consideration the above two critical issues simultaneously. Therefore, in this paper, we exploit the idea of neighborhood granularity to enhance the traditional logical labels into label distribution forms to excavate the deeper supervised information hidden in multi-label data, and further consider the effect of the test cost under three different distributions, simultaneously. Motivated by these issues, a novel test cost multi-label feature selection algorithm with label enhancement and neighborhood granularity is designed. Moreover, the proposed algorithm is tested upon ten publicly available benchmark multi-label datasets with six widely-used metrics from two different aspects. Finally, two groups of experimental results demonstrate that the proposed algorithm achieves the satisfactory and superior performance over other four state-of-the-art comparing algorithms, and it is effective for improving the learning performance and decreasing the total test costs of the selected feature subset.

Cost-sensitive feature selection on multi-label data via neighborhood granularity and label enhancement

期刊

APPLIED INTELLIGENCE

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Cost-sensitive feature selection on multi-label data via neighborhood granularity and label enhancement

期刊

APPLIED INTELLIGENCE

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文