4.7 Article

AMFuse: Add-Multiply-Based Cross-Modal Fusion Network for Multi-Spectral Semantic Segmentation

期刊

REMOTE SENSING
卷 14, 期 14, 页码 -

出版社

MDPI
DOI: 10.3390/rs14143368

关键词

multi-spectral images; cross-modal feature fusion network; semantic segmentation; salient object detection

资金

  1. National Natural Science Foundation of China [62001063, U20A20157, U2133211]
  2. China Postdoctoral Science Foundation [2020M673135]
  3. Chongqing Postdoctoral Research Program [XmT2020050]

向作者/读者索取更多资源

This paper proposes a simple yet effective add-multiply fusion (AMFuse) module for fusing RGB and thermal information. The module extracts cross-modal complementary features and common features through addition and multiplication operations, respectively. Attention and ASPP modules are incorporated to enhance multi-scale context information. Experimental results demonstrate the effectiveness of the proposed AMFuse module in multi-spectral semantic segmentation and salient object detection.
Multi-spectral semantic segmentation has shown great advantages under poor illumination conditions, especially for remote scene understanding of autonomous vehicles, since the thermal image can provide complementary information for RGB image. However, methods to fuse the information from RGB image and thermal image are still under-explored. In this paper, we propose a simple but effective module, add-multiply fusion (AMFuse) for RGB and thermal information fusion, consisting of two simple math operations-addition and multiplication. The addition operation focuses on extracting cross-modal complementary features, while the multiplication operation concentrates on the cross-modal common features. Moreover, the attention module and atrous spatial pyramid pooling (ASPP) modules are also incorporated into our proposed AMFuse modules, to enhance the multi-scale context information. Finally, in the UNet-style encoder-decoder framework, the ResNet model is adopted as the encoder. As for the decoder part, the multi-scale information obtained from our proposed AMFuse modules is hierarchically merged layer-by-layer to restore the feature map resolution for semantic segmentation. The experiments of RGBT multi-spectral semantic segmentation and salient object detection demonstrate the effectiveness of our proposed AMFuse module for fusing the RGB and thermal information.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据