4.3 Article

Multiscale attention dynamic aware network for fine-grained visual categorization

期刊

ELECTRONICS LETTERS
卷 59, 期 1, 页码 -

出版社

WILEY
DOI: 10.1049/ell2.12696

关键词

data mining; image classification; image recognition

向作者/读者索取更多资源

In this paper, the authors propose a novel neural network, MADA-Net, for fine-grained visual categorization. It addresses the challenges of inter-class similarities and scale variation through multiscale attention mechanisms and a dynamic aware module. A multiscale adjusted loss is also introduced to improve the network performance.
Fine-grained visual categorization (FGVC) is a challenging task, facing the issues such as inter-class similarities, large intra-class variances, scale variation, and angle variation. To address these issues, the authors propose a novel multiscale attention dynamic aware network (MADA-Net). The core of network consists of three parallel sub-networks, which learn features from different scales. Each sub-network is composed of three serial sub-modules: (1) A self-attention module (SAM) locates objects according to relative importance scattered throughout feature map. (2) A multiscale feature extractor (MFE) learns the non-linear features of objects. (3) A dynamic aware module (DAM) enhances the learning capability of spatial deformation of the network to generate high-quality feature map. In addition, the authors propose a multiscale adjusted loss (MA-Loss) to improve the performance of network. Experiments on three prevailing benchmark datasets demonstrate that our method can achieve state-of-the-art performance.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据