4.7 Article

Multi-granularity feature selection on cost-sensitive data with measurement errors and variable costs

期刊

KNOWLEDGE-BASED SYSTEMS
卷 158, 期 -, 页码 25-42

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.knosys.2018.05.020

关键词

Feature-granularity selection; Measurement errors; Multi-granularity; Neighborhood; Rough sets; Variable costs

资金

  1. National Natural Science Foundation of China [61672332, 61603173]
  2. Natural Science Foundation of Fujian Province, China [2016J01315, 2017J01771]
  3. Education Department of Fujian Province [JAT160291]
  4. Institute of Meteorological Big Data-Digital Fujian
  5. Fujian Key Laboratory of Data Science and Statistics

向作者/读者索取更多资源

In real applications of data mining, machine learning and granular computing, measurement errors, test costs and misclassification costs often occur. Furthermore, the test cost of a feature is usually variable with the error range, and the variability of the misclassification cost is related to the object considered. Recently, some approaches based on rough sets have been introduced to study the error-based cost-sensitive feature selection problem. However, most of them consider only single-granularity cases, thus are not feasible for the case where the granularity diversity between different features should be taken into account. Motivated by this problem, we propose a multi-granularity feature selection approach which considers measurement errors and variable costs in terms of feature-value granularities. For a given feature, the feature-value granularity is evaluated by the error confidence level of the feature values. In this way, we build a theoretic framework called confidence-level vector-based neighborhood rough set, and present a so-called heuristic feature-granularity selection algorithm, and a relevant competition strategy which can select both features and their respective feature-value granularities effectively and efficiently. Experiment results show that a satisfactory trade-off among feature dimension reduction, feature-value granularity selection and total cost minimization can be achieved by the proposed approach. This work would provide a new insight into the cost-sensitive feature selection problem from the multi-granularity perspective.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据