4.7 Article

A self-adaptive weighted differential evolution approach for large-scale feature selection

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 235, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2021.107633

Keywords

Feature selection; Differential evolution; High-dimensional data; Classification; Self-adaptive; Multi-population

Funding

  1. National Natural Science Foundation of China [62076109, 32000464]
  2. Natural Science Foundation of Jilin Province [20190103006JH]
  3. Research Grants Council of the Hong Kong Special Ad-ministrative Region [CityU 11200218]
  4. Health and Medical Research Fund
  5. Food and Health Bureau, The Government of the Hong Kong Special Administrative Region [07181426]
  6. Hong Kong Institute for Data Science (HKIDS) at City University of Hong Kong
  7. City University of Hong Kong [CityU 11202219, CityU 11203520]

Ask authors/readers for more resources

This paper introduces a novel weighted differential evolution algorithm named SaWDE to address large-scale feature selection problems. The algorithm utilizes a multi-population mechanism, a new self-adaptive mechanism, and a weighted model to identify important features, demonstrating superior performance on twelve large-scale datasets.
Recently, many evolutionary computation methods have been developed to solve the feature selection problem. However, the studies focused mainly on small-scale issues, resulting in stagnation issues in local optima and numerical instability when dealing with large-scale feature selection dilemmas. To address these challenges, this paper proposes a novel weighted differential evolution algorithm based on self-adaptive mechanism, named SaWDE, to solve large-scale feature selection. First, a multi -population mechanism is adopted to enhance the diversity of the population. Then, we propose a new self-adaptive mechanism that selects several strategies from a strategy pool to capture the diverse characteristics of the datasets from the historical information. Finally, a weighted model is designed to identify the important features, which enables our model to generate the most suitable feature-selection solution. We demonstrate the effectiveness of our algorithm on twelve large-scale datasets. The performance of SaWDE is superior compared to six non-EC algorithms and six other EC algorithms, on both training and test datasets and on subset size, indicating that our algorithm is a favorable tool to solve the large-scale feature selection problem. Moreover, we have experimented SaWDE with six EC algorithms on twelve higher-dimensional data, which demonstrates that SaWDE is more robust and efficient compared to those state-of-the-art methods. SaWDE source code is available on Github at https://github.com/wangxb96/SaWDE. (C) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available