☆ 4.7 Article

AMFuse: Add-Multiply-Based Cross-Modal Fusion Network for Multi-Spectral Semantic Segmentation

REMOTE SENSING (2022)

期刊

REMOTE SENSING

卷 14, 期 14, 页码 -

出版社

MDPI

DOI: 10.3390/rs14143368

关键词

multi-spectral images; cross-modal feature fusion network; semantic segmentation; salient object detection

类别

Environmental Sciences Geosciences, Multidisciplinary Remote Sensing Imaging Science & Photographic Technology

资金

National Natural Science Foundation of China [62001063, U20A20157, U2133211]
China Postdoctoral Science Foundation [2020M673135]
Chongqing Postdoctoral Research Program [XmT2020050]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a simple yet effective add-multiply fusion (AMFuse) module for fusing RGB and thermal information. The module extracts cross-modal complementary features and common features through addition and multiplication operations, respectively. Attention and ASPP modules are incorporated to enhance multi-scale context information. Experimental results demonstrate the effectiveness of the proposed AMFuse module in multi-spectral semantic segmentation and salient object detection.

Multi-spectral semantic segmentation has shown great advantages under poor illumination conditions, especially for remote scene understanding of autonomous vehicles, since the thermal image can provide complementary information for RGB image. However, methods to fuse the information from RGB image and thermal image are still under-explored. In this paper, we propose a simple but effective module, add-multiply fusion (AMFuse) for RGB and thermal information fusion, consisting of two simple math operations-addition and multiplication. The addition operation focuses on extracting cross-modal complementary features, while the multiplication operation concentrates on the cross-modal common features. Moreover, the attention module and atrous spatial pyramid pooling (ASPP) modules are also incorporated into our proposed AMFuse modules, to enhance the multi-scale context information. Finally, in the UNet-style encoder-decoder framework, the ResNet model is adopted as the encoder. As for the decoder part, the multi-scale information obtained from our proposed AMFuse modules is hierarchically merged layer-by-layer to restore the feature map resolution for semantic segmentation. The experiments of RGBT multi-spectral semantic segmentation and salient object detection demonstrate the effectiveness of our proposed AMFuse module for fusing the RGB and thermal information.

AMFuse: Add-Multiply-Based Cross-Modal Fusion Network for Multi-Spectral Semantic Segmentation

期刊

REMOTE SENSING

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

AMFuse: Add-Multiply-Based Cross-Modal Fusion Network for Multi-Spectral Semantic Segmentation

期刊

REMOTE SENSING

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文