4.7 Article

CCFS: A Confidence-Based Cost-Effective Feature Selection Scheme for Healthcare Data Classification

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2019.2903804

关键词

Feature extraction; Particle swarm optimization; Medical services; Classification algorithms; Genetic algorithms; Error analysis; Machine learning algorithms; Data classification; feature selection; binary particle swarm optimization; swarm intelligence

资金

  1. NSFC [61801240]
  2. QingLan Project of JiangSu Province

向作者/读者索取更多资源

The proposed Confidence-based and Cost-effective feature selection method (CCFS) utilizes BPSO to enhance healthcare data classification performance. By introducing a new updating mechanism and considering factors such as feature confidence, historical selection frequency, feature cost, and feature reduction ratio, the method has achieved promising experimental results.
Feature selection (FS) is one of the fundamental data processing techniques in various machine learning algorithms, especially for classification of healthcare data. However, it is a challenging issue due to the large search space. Binary Particle Swarm Optimization (BPSO) is an efficient evolutionary computation technique, and has been widely used in FS. In this paper, we proposed a Confidence-based and Cost-effective feature selection (CCFS) method using BPSO to improve the performance of healthcare data classification. Specifically, first, CCFS improves search effectiveness by developing a new updating mechanism that designs the feature confidence to explicitly take into account the fine-grained impact of each dimension in the particle on the classification performance. The feature confidence is composed of two measurements: the correlation between feature and categories, and historically selected frequency of each feature. Second, considering the fact that the acquisition costs of different features are naturally different, especially for medical data, and should be fully taken into account in practical applications, besides the classification performance, the feature cost and the feature reduction ratio are comprehensively incorporated into the design of fitness function. The proposed method has been verified in various UCI public datasets and compared with various benchmark schemes. The thoroughly experimental results show the effectiveness of the proposed method, in terms of accuracy and feature selection cost.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据