☆ 4.6 Article

Multi-Attention-Based Semantic Segmentation Network for Land Cover Remote Sensing Images

ELECTRONICS (2023)

期刊

ELECTRONICS

卷 12, 期 6, 页码 -

出版社

MDPI

DOI: 10.3390/electronics12061347

关键词

remote sensing image; attention mechanism; image segmentation; deep learning; semantic segmentation

类别

Computer Science, Information Systems Engineering, Electrical & Electronic Physics, Applied

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a multi-attention-based semantic segmentation network for remote sensing images, addressing the challenges of multiple targets and large feature differences in such images. The model achieves improved extraction capability for fine-grained features by using a coordinate attention-based residual network in the encoder, replaces traditional upsampling operator with a content-aware reorganization module in the decoder to enhance network information extraction, and introduces a fused attention module for feature map fusion to solve the multi-scale problem. Experimental results show superior performance of the proposed model on both WHDLD dataset and self-labeled Lu County dataset, surpassing commonly used benchmark models.

Semantic segmentation is a key technology for remote sensing image analysis widely used in land cover classification, natural disaster monitoring, and other fields. Unlike traditional image segmentation, there are various targets in remote sensing images, with a large feature difference between the targets. As a result, segmentation is more difficult, and the existing models retain low accuracy and inaccurate edge segmentation when used in remote sensing images. This paper proposes a multi-attention-based semantic segmentation network for remote sensing images in order to address these problems. Specifically, we choose UNet as the baseline model, using a coordinate attention-based residual network in the encoder to improve the extraction capability of the backbone network for fine-grained features. We use a content-aware reorganization module in the decoder to replace the traditional upsampling operator to improve the network information extraction capability, and, in addition, we propose a fused attention module for feature map fusion after upsampling, aiming to solve the multi-scale problem. We evaluate our proposed model on the WHDLD dataset and our self-labeled Lu County dataset. The model achieved an mIOU of 63.27% and 72.83%, and an mPA of 74.86% and 84.72%, respectively. Through comparison and confusion matrix analysis, our model outperformed commonly used benchmark models on both datasets.

Multi-Attention-Based Semantic Segmentation Network for Land Cover Remote Sensing Images

期刊

ELECTRONICS

出版社

MDPI

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multi-Attention-Based Semantic Segmentation Network for Land Cover Remote Sensing Images

期刊

ELECTRONICS

出版社

MDPI

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文