4.7 Article

Hybrid Multiple Attention Network for Semantic Segmentation in Aerial Images

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TGRS.2021.3065112

关键词

Semantics; Image segmentation; Correlation; Feature extraction; Task analysis; Remote sensing; Graphics processing units; Aerial imagery; deep convolution neural networks (DCNNs); self-attention (SA) mechanism; semantic segmentation

资金

  1. National Natural Science Foundation of China [61725105, 41701508]

向作者/读者索取更多资源

This research proposes a novel attention-based framework named hybrid multiple attention network (HMANet) for semantic segmentation in remote sensing images. The HMANet adaptively captures global correlations by introducing class augmented attention and region shuffle attention modules, improving the efficiency and effectiveness of the self-attention mechanism.
Semantic segmentation in very-high-resolution (VHR) aerial images is one of the most challenging tasks in remote sensing image understanding. Most of the current approaches are based on deep convolutional neural networks (DCNNs). However, standard convolution with local receptive fields fails in modeling global dependencies. Prior research works have indicated that attention-based methods can capture long-range dependencies and further reconstruct the feature maps for better representation. Nevertheless, limited by the mere perspective of spatial and channel attention and huge computation complexity of self-attention (SA) mechanism, it is unlikely to model the effective semantic interdependencies between each pixel pair of remote sensing data with complex spectra. In this work, we propose a novel attention-based framework named hybrid multiple attention network (HMANet) to adaptively capture global correlations from the perspective of space, channel, and category in a more effective and efficient manner. Concretely, a class augmented attention (CAA) module embedded with a class channel attention (CCA) module can be used to compute category-based correlation and recalibrate the class-level information. In addition, we introduce a simple yet effective region shuffle attention (RSA) module to reduce feature redundant and improve the efficiency of SA mechanism via regionwise representations. Extensive experimental results on the ISPRS Vaihingen, Potsdam benchmark, and iSAID data set demonstrate the effectiveness and efficiency of our HMANet over other state-of-the-art methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据