4.5 Article

MAE-BG: dual-stream boundary optimization for remote sensing image semantic segmentation

Journal

GEOCARTO INTERNATIONAL
Volume 38, Issue 1, Pages -

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/10106049.2023.2190622

Keywords

Deep learning; Boundary optimization; Squeeze and excitation; Semantic segmentation; Remote sensing

Ask authors/readers for more resources

In this study, a dual-stream network MAE-BG was proposed, consisting of an edge detection (ED) branch and a smooth branch with boundary guidance (BG). The ED branch enhances weak edges and suppresses false responses caused by local texture, while the MAE networks extract multiscale edge information to complement detail loss. The segmentation results with improved boundaries are obtained by stacking the output of the ED and smooth branches. The proposed method achieves precise object boundary location and improved segmentation performance.
Deep learning has achieved remarkable performance in semantically segmenting remotely sensed images. However, the high-frequency detail loss caused by continuous convolution and pooling operations and the uncertainty introduced when annotating low-contrast objects with weak boundaries induce blurred object boundaries. Therefore, a dual-stream network MAE-BG, consisting of an edge detection (ED) branch and a smooth branch with boundary guidance (BG), is proposed. The ED branch is designed to enhance the weak edges that need to be preserved, simultaneously suppressing false responses caused by local texture. This mechanism is achieved by introducing improved multiple-attention edge detection blocks (MAE). Furthermore, two specific ED branches with MAE are designed to combine with typical deep convolutional (DC) and Codec infrastructures and result in two configurations of MAE-A and MAE-B. Meanwhile, multiscale edge information extracted by MAE networks is fed into the backbone networks to complement the detail loss caused by convolution and pooling operations. This results in smooth networks with BG. After that, the segmentation results with improved boundaries are obtained by stacking the output of the ED and smooth branches. The proposed algorithms were evaluated on the ISPRS Potsdam and Inria Aerial Image Labelling datasets. Comprehensive experiments show that the proposed method can precisely locate object boundaries and improve segmentation performance. The MAE-A branch leads to an overall accuracy (OA) of 89.16%, a mean intersection over union (MIOU) of 80.25% for Potsdam, and an OA of 96.61% and MIOU of 86.63% for Inria. Compared with the results without the proposed edge optimization blocks, the OAs from the Potsdam and Inria datasets increase by 5.49% and 7.64%, respectively.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available