4.7 Article

SSDBN: A Single-Side Dual-Branch Network with Encoder-Decoder for Building Extraction

Journal

REMOTE SENSING
Volume 14, Issue 3, Pages -

Publisher

MDPI
DOI: 10.3390/rs14030768

Keywords

building extraction; dual-branch; semantic segmentation; encoder-decoder network

Funding

  1. Key Laboratory Foundation of National Defence Technology [61424010208]
  2. National Natural Science Foundation of China [62002276, 41911530242, 41975142, 05492018012, 05762018039]
  3. Major Program of the National Social Science Fund of China [17ZDA092]
  4. 333 High-Level Talent Cultivation Project of Jiangsu Province [BRA2018332]
  5. Royal Society of Edinburgh, UK
  6. China Natural Science Foundation Council
  7. basic Research Programs (Natural Science Foundation) of Jiangsu Province [BK20191398, BK20180794]

Ask authors/readers for more resources

In this paper, we propose a new single-side dual-branch network (SSDBN) based on an encoder-decoder structure, which can accurately perform semantic segmentation and improve the capturing ability of semantic details.
In the field of building detection research, an accurate, state-of-the-art semantic segmentation model must be constructed to classify each pixel of the image, which has an important reference value for the statistical work of a building area. Recent research efforts have been devoted to semantic segmentation using deep learning approaches, which can be further divided into two aspects. In this paper, we propose a single-side dual-branch network (SSDBN) based on an encoder-decoder structure, where an improved Res2Net model is used at the encoder stage to extract the basic feature information of prepared images while a dual-branch module is deployed at the decoder stage. An intermediate framework was designed using a new feature information fusion methods to capture more semantic information in a small area. The dual-branch decoding module contains a deconvolution branch and a feature enhancement branch, which are responsible for capturing multi-scale information and enhancing high-level semantic details, respectively. All experiments were conducted using the Massachusetts Buildings Dataset and WHU Satellite Dataset I (global cities). The proposed model showed better performance than other recent approaches, achieving an F1-score of 87.69% and an IoU of 75.83% with a low network size volume (5.11 M), internal parameters (19.8 MB), and GFLOPs (22.54), on the Massachusetts Buildings Dataset.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available