4.6 Article

Attention Block Based on Binary Pooling

期刊

APPLIED SCIENCES-BASEL
卷 13, 期 18, 页码 -

出版社

MDPI
DOI: 10.3390/app131810012

关键词

ResNet; attention; image classification

向作者/读者索取更多资源

Image classification is crucial in computer vision with a wide range of applications. This study proposes a novel pooling operation, named Binary Pooling, which combines Global Average Pooling (GAP) and Global Max Pooling (GMP) to extract more comprehensive image features. By applying dilation operations and pointwise convolutions, the extraction of image features is further enhanced. Experimental results show that integrating this attention block into ResNet18/50 models improves accuracy on ImageNet.
Image classification has become highly significant in the field of computer vision due to its wide array of applications. In recent years, Convolutional Neural Networks (CNN) have emerged as potent tools for addressing this task. Attention mechanisms offer an effective approach to enhance the accuracy of image classification. Despite Global Average Pooling (GAP) being a crucial component of traditional attention mechanisms, it only computes the average of spatial elements in each channel, failing to capture the complete range of feature information, resulting in fewer and less expressive features. To address this limitation, we propose a novel pooling operation named Binary Pooling and integrate it into the attention block. Binary pooling combines both GAP and Global Max Pooling (GMP), obtaining a more comprehensive feature vector by extracting average and maximum values, thereby enriching the diversity of extracted image features. Furthermore, to further enhance the extraction of image features, dilation operations and pointwise convolutions are applied on the channel-wise. The proposed attention block is simple yet highly effective. Upon integration into ResNet18/50 models, it leads to accuracy improvements of 2.02%/0.63% on ImageNet.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据