4.7 Article

Optimizing convolutional neural networks architecture using a modified particle swarm optimization for image classification

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 229, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2023.120411

Keywords

Convolutional neural networks; Neural network design; Hyperparameters; Particle swarm optimization; Image classification

Ask authors/readers for more resources

In this work, a method is proposed to automatically generate the best CNN design, using a particle swarm optimization algorithm. A novel strategy is developed to determine the updated position of each particle without relying on the velocity equation. Experimental results show that the proposed method can quickly find effective CNN architectures with comparable performance to the best designs currently available.
Although Convolutional Neural Networks (CNNs) have been shown to be highly effective in image classification tasks, designing their architecture to achieve optimal results is often challenging. This process is time consum-ing, requires significant effort and expertise, and is complicated by the large number of hyperparameters. To address this problem, in this work we propose an approach that reduces human intervention and automatically generates the best CNN design. Our approach uses a variant of Particle Swarm Optimization (PSO), called Particle Swarm Optimization without Velocity (PSWV), to speed up convergence and reduce the number of iterations required to determine the optimal CNN hyperparameters. We developed a novel strategy to determine the updated position of each particle using a linear combination of the best position of the particle and the best position of the swarm without relying on the velocity equation. Our algorithm harnesses the power of the variable-length encoding strategy to represent particles within the population, thereby providing swift convergence towards the best architecture. We evaluate our proposed algorithm against several recent algorithms in the literature by using nine benchmark datasets for classification tasks and comparing it to 27 other algorithms, including state-of-the-art ones. Our experimental results show that our proposed method, pswvCNN, is able to quickly find effective CNN architectures that provide comparable performance to the best currently available designs, indicating its significant potential.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available