4.6 Article

A self-adaptive level-based learning artificial bee colony algorithm for feature selection on high-dimensional classification

Journal

SOFT COMPUTING
Volume 26, Issue 18, Pages 9665-9687

Publisher

SPRINGER
DOI: 10.1007/s00500-022-06826-1

Keywords

Artificial bee colony algorithm; Feature selection; Classification; High dimensionality

Funding

  1. Special Funds of Science and Technology Innovation Project of Fujian Agriculture and Forestry University [KFA17030A, KFA17181A]
  2. Natural Science Foundation of Fujian Province [2017J01607, 2018J01612]
  3. Forestry Science and Technology Projects in Fujian Province (Memorandums 26)
  4. Fund for Excellent Dissertation in Fujian Agriculture and Forestry University, China [1122YS01004]

Ask authors/readers for more resources

This paper proposes a self-adaptive level-based learning artificial bee colony (SLLABC) algorithm for high-dimensional feature selection problem. The algorithm introduces novel mechanisms to accelerate convergence, balance exploration and exploitation abilities, and reduce the number of selected features. Experimental results show that the proposed SLLABC algorithm achieves competitive performance in terms of classification accuracy and feature subset size.
Feature selection is an important data preprocessing method in data mining and machine learning, yet it faces the challenge of curse of dimensionality when dealing with high-dimensional data. In this paper, a self-adaptive level-based learning artificial bee colony (SLLABC) algorithm is proposed for high-dimensional feature selection problem. The SLLABC algorithm includes three new mechanisms: (1) A novel level-based learning mechanism is introduced to accelerate the convergence of the basic artificial bee colony algorithm, which divides the population into several levels and the individuals on each level learn from the individuals on higher levels, especially, the individuals on the highest level learn from each other. (2) A self-adaptive method is proposed to keep the balance between exploration and exploitation abilities, which takes the diversity of population into account to determine the number of levels. The lower the diversity is, the fewer the levels are divided. (3) A new update mechanism is proposed to reduce the number of selected features. In this mechanism, if the error rate of an offspring is higher than or is equal to that of its parent but selects more features, then the offspring is discarded and the parent is retained, otherwise, the offspring replaces its parent. Further, we discuss and analyze the contribution of these novelties to the diversity of population and the performance of classification. Finally, the results, compared with 8 state-of-the-art algorithms on 12 high-dimensional datasets, confirm the competitive performance of the proposed SLLABC on both classification accuracy and the size of the feature subset.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available