4.7 Article

Fast Convolutional Neural Network with iterative and non-iterative learning

Journal

APPLIED SOFT COMPUTING
Volume 125, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.asoc.2022.109197

Keywords

Convolutional Neural Network; Image classification; Feature extraction; Radial basis function

Ask authors/readers for more resources

This study proposes a novel architecture of convolutional neural network with a non-iterative radial basis function-based classification layer, making it more efficient for image classification tasks. Experimental results show that the proposed architecture outperforms the traditional CNN methods in terms of accuracy, parameter tuning, and time efficiency.
Convolutional Neural Networks (CNNs) have achieved potentially good results for image classification. Due to their learning capabilities such networks are explored and implemented in real world applications. However, when it comes to training a CNN on large datasets without transfer learning, it takes long time and resources, due to the iterative process of weight update in fully connected layer. The backpropagation algorithm used to train the entire network suffers from slow convergence, getting trapped in a local minimum, not guaranteed to find global minimum and being hypersensitive to the learning rate set-up. Therefore, in this paper we propose a novel architecture of convolutional neural network with a non-iterative radial basis function-based classification layer which makes it more efficient for image classification tasks. This is a partially parameter free direct learning approach, which is highly beneficial in real-world applications as well as learning associated with large datasets. The proposed approach has been evaluated on four benchmark datasets such as CIFAR-10, MNIST, Digit and CE-MRI. It has achieved the accuracy scores of 99.8%, 99.8%, 98.4% and 98.3% on MNIST, Digit, CE-MRI and CIFAR-10 datasets respectively, when ResNet18 was used as backbone. Experimental results have illustrated that the proposed architecture is much better as it requires less parameter tuning, does not require handcrafted features and achieves same or higher accuracy in relatively lesser time than the standard CNN. Statistical significance tests were carried out to prove the efficiency of the proposed approach. (C) 2022 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available