4.5 Article

MAE-BG: dual-stream boundary optimization for remote sensing image semantic segmentation

期刊

GEOCARTO INTERNATIONAL
卷 38, 期 1, 页码 -

出版社

TAYLOR & FRANCIS LTD
DOI: 10.1080/10106049.2023.2190622

关键词

Deep learning; Boundary optimization; Squeeze and excitation; Semantic segmentation; Remote sensing

向作者/读者索取更多资源

In this study, a dual-stream network MAE-BG was proposed, consisting of an edge detection (ED) branch and a smooth branch with boundary guidance (BG). The ED branch enhances weak edges and suppresses false responses caused by local texture, while the MAE networks extract multiscale edge information to complement detail loss. The segmentation results with improved boundaries are obtained by stacking the output of the ED and smooth branches. The proposed method achieves precise object boundary location and improved segmentation performance.
Deep learning has achieved remarkable performance in semantically segmenting remotely sensed images. However, the high-frequency detail loss caused by continuous convolution and pooling operations and the uncertainty introduced when annotating low-contrast objects with weak boundaries induce blurred object boundaries. Therefore, a dual-stream network MAE-BG, consisting of an edge detection (ED) branch and a smooth branch with boundary guidance (BG), is proposed. The ED branch is designed to enhance the weak edges that need to be preserved, simultaneously suppressing false responses caused by local texture. This mechanism is achieved by introducing improved multiple-attention edge detection blocks (MAE). Furthermore, two specific ED branches with MAE are designed to combine with typical deep convolutional (DC) and Codec infrastructures and result in two configurations of MAE-A and MAE-B. Meanwhile, multiscale edge information extracted by MAE networks is fed into the backbone networks to complement the detail loss caused by convolution and pooling operations. This results in smooth networks with BG. After that, the segmentation results with improved boundaries are obtained by stacking the output of the ED and smooth branches. The proposed algorithms were evaluated on the ISPRS Potsdam and Inria Aerial Image Labelling datasets. Comprehensive experiments show that the proposed method can precisely locate object boundaries and improve segmentation performance. The MAE-A branch leads to an overall accuracy (OA) of 89.16%, a mean intersection over union (MIOU) of 80.25% for Potsdam, and an OA of 96.61% and MIOU of 86.63% for Inria. Compared with the results without the proposed edge optimization blocks, the OAs from the Potsdam and Inria datasets increase by 5.49% and 7.64%, respectively.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据