Journal
EXPERT SYSTEMS WITH APPLICATIONS
Volume 176, Issue -, Pages -Publisher
PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2021.114778
Keywords
Feature selection; Harris hawks optimization; Sine-cosine algorithm; High-dimensional data; Optimization problems
Categories
Funding
- University of Electronic Science and Technology of China (UESTC)
- Minia University - National Natural Science Foundation of China (NSFC) [61772120]
Ask authors/readers for more resources
Feature selection is crucial in data mining, and the SCHHO hybrid optimization method proposed in this paper shows promising results in terms of efficient search and improved accuracy. The method integrates different search strategies to tackle the optimization problem in numerical optimization and feature selection. The experimental and statistical analyses demonstrate the effectiveness of SCHHO in reducing feature-size and achieving high accuracy without additional computational cost. Potential future directions for research are highlighted based on the findings of this study.
Feature selection, an optimization problem, becomes an important pre-process tool in data mining, which simultaneously aims at minimizing feature-size and maximizing model generalization. Because of large search space, conventional optimization methods often fail to generate global optimum solution. A variety of hybrid techniques merging different search strategies have been proposed in feature selection literature, but mostly deal with low dimensional datasets. In this paper, a hybrid optimization method is proposed for numerical optimization and feature selection, which integrates sine-cosine algorithm (SCA) in Harris hawks optimization (HHO). The goal of SCA integration is to cater ineffective exploration in HHO, moreover exploitation is enhanced by dynamically adjusting candidate solutions for avoiding solution stagnancy in HHO. The proposed method, namely SCHHO, is evaluated by employing CEC?17 test suite for numerical optimization and sixteen datasets with low and high-dimensions exceeding 15000 attributes, and compared with original SCA and HHO, as well as, other well-known optimization methods like dragonfly algorithm (DA), whale optimization algorithm (WOA), grasshopper optimization algorithm (GOA), Grey wolf optimization (GWO), and salp swarm algorithm (SSA); in addition to state-of-the-art methods. Performance of the proposed method is also validated against hybrid methods proposed in recent related literature. The extensive experimental and statistical analyses suggest that the proposed hybrid variant of HHO is able to produce efficient search results without additional computational cost. With increased convergence speed, SCHHO reduced feature-size up to 87% and achieved accuracy up to 92%. Motivated from the findings of this study, various potential future directions are also highlighted.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available