☆ 4.6 Article

SCMA: Exploring Dual-Module Attention With Multi-Scale Kernels for Effective Feature Extraction

IEEE ACCESS (2023)

Journal

IEEE ACCESS

Volume 11, Issue -, Pages 132088-132100

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/ACCESS.2023.3329581

Keywords

Channel attention; convolutional neural network; dual-module attention; feature extraction; spatial attention

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Feature space enrichment is crucial for the development of attention mechanisms in CNNs. The research presents SCMA, an attention mechanism that combines channel and spatial attention to extract features efficiently while balancing parameter efficiency and accuracy.

Feature space enrichment is an integral part of the development of attention mechanisms in Convolutional Neural Networks (CNNs). The ability to efficiently extract channel and spatial information across a variety of scales is crucial. Furthermore, balancing model parameter efficiency while ensuring higher accuracy is a key objective. To create a compelling and robust attention mechanism, channel and spatial attention must be carefully incorporated into CNN architecture. This research work addresses these challenges and presents an attention mechanism called Spatial and Channel aware Multi-scale kernel Attention (SCMA) for CNNs. Our approach leverages the combination of two separate attention modules, one for channel-wise attention and another for spatial attention, in sequential order to refine intermediate feature representations in a CNN. The SCMA module is designed to be compact and universal, capable of being seamlessly integrated into any baseline CNN architecture with minimal parameter overhead, and can be trained in an end-to-end manner. Our empirical findings regarding the utilization of SCMA in conjunction with various CNN architectures for image classification tasks on multiple benchmark datasets including Imagenette, Imagewoof, CIFAR-10, CIFAR-100, and CINIC, affirm the intuition that multi-scale kernels are pivotal for effectively capturing dependencies across both spatial and channel dimensions. In many instances, SCMA exhibits higher performance in terms of accuracy than its state-of-the-art counterparts while keeping the parameter overhead to a minimum.

SCMA: Exploring Dual-Module Attention With Multi-Scale Kernels for Effective Feature Extraction

Journal

IEEE ACCESS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

SCMA: Exploring Dual-Module Attention With Multi-Scale Kernels for Effective Feature Extraction

Journal

IEEE ACCESS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper