☆ 4.5 Article

Subtler mixed attention network on fine-grained image classification

APPLIED INTELLIGENCE (2021)

期刊

APPLIED INTELLIGENCE

卷 51, 期 11, 页码 7903-7916

出版社

SPRINGER

DOI: 10.1007/s10489-021-02280-y

关键词

Fine-grained image recognition; Attention mechanism

类别

Computer Science, Artificial Intelligence

资金

National Natural Science Foundation of China [61872326, 61672475]
Shandong Provincial Natural Science Foundation [ZR2019MF044]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The study introduces the SMA-Net which includes modules for locating discriminative regions and extracting features from subtler and differentiated regions. These modules ensure lightweight integration into advanced networks while enhancing accuracy in fine-grained image categorization tasks.

The key of fine-grained image categorization is to locate discriminative regions and feature extraction from these regions correspond to subtle visual traits. Some of the current methods use the attention mechanism to identify the discriminative region, but ignore that there is still a large amount of non-foreground noise information in these regions. In this work, we propose a Subtler Mixed Attention Network (SMA-Net), which contains two modules: 1) Discriminative region location module uses the channel attention mechanism to construct a feature pyramid network to locate the discriminative regions. And use the positive effect of classification to screen a group of the most discriminative regions and learn through rank to learn. 2) Mixed attention module (MAM) of feature extraction that can focus on subtler and differentiated regions. We divide the feature map into intervals according to regions, and learn attention features according to regional orientation. Then the attention maps are multiplied to the input feature map for adaptive features reinforce. At the same time, MAM is a lightweight module that can be easily integrated into advanced networks without increasing too much calculation. We validated our SMA-Net through substantial experiments on Caltech-UCSD Birds (CUB-200-2011), Stanford Cars, CIFAR-10, Fish4Knowledge and Flower17. In particular, the accuracy on two widely used fine-grained datasets, CUB-2011 and Stanford Cars, reached 87.71% and 94.37%, respectively.

Subtler mixed attention network on fine-grained image classification

期刊

APPLIED INTELLIGENCE

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Subtler mixed attention network on fine-grained image classification

期刊

APPLIED INTELLIGENCE

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文