4.6 Article

A self-adaptive level-based learning artificial bee colony algorithm for feature selection on high-dimensional classification

期刊

SOFT COMPUTING
卷 26, 期 18, 页码 9665-9687

出版社

SPRINGER
DOI: 10.1007/s00500-022-06826-1

关键词

Artificial bee colony algorithm; Feature selection; Classification; High dimensionality

资金

  1. Special Funds of Science and Technology Innovation Project of Fujian Agriculture and Forestry University [KFA17030A, KFA17181A]
  2. Natural Science Foundation of Fujian Province [2017J01607, 2018J01612]
  3. Forestry Science and Technology Projects in Fujian Province (Memorandums 26)
  4. Fund for Excellent Dissertation in Fujian Agriculture and Forestry University, China [1122YS01004]

向作者/读者索取更多资源

This paper proposes a self-adaptive level-based learning artificial bee colony (SLLABC) algorithm for high-dimensional feature selection problem. The algorithm introduces novel mechanisms to accelerate convergence, balance exploration and exploitation abilities, and reduce the number of selected features. Experimental results show that the proposed SLLABC algorithm achieves competitive performance in terms of classification accuracy and feature subset size.
Feature selection is an important data preprocessing method in data mining and machine learning, yet it faces the challenge of curse of dimensionality when dealing with high-dimensional data. In this paper, a self-adaptive level-based learning artificial bee colony (SLLABC) algorithm is proposed for high-dimensional feature selection problem. The SLLABC algorithm includes three new mechanisms: (1) A novel level-based learning mechanism is introduced to accelerate the convergence of the basic artificial bee colony algorithm, which divides the population into several levels and the individuals on each level learn from the individuals on higher levels, especially, the individuals on the highest level learn from each other. (2) A self-adaptive method is proposed to keep the balance between exploration and exploitation abilities, which takes the diversity of population into account to determine the number of levels. The lower the diversity is, the fewer the levels are divided. (3) A new update mechanism is proposed to reduce the number of selected features. In this mechanism, if the error rate of an offspring is higher than or is equal to that of its parent but selects more features, then the offspring is discarded and the parent is retained, otherwise, the offspring replaces its parent. Further, we discuss and analyze the contribution of these novelties to the diversity of population and the performance of classification. Finally, the results, compared with 8 state-of-the-art algorithms on 12 high-dimensional datasets, confirm the competitive performance of the proposed SLLABC on both classification accuracy and the size of the feature subset.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据