4.6 Article

DFANet: Denoising Frequency Attention Network for Building Footprint Extraction in Very-High-Resolution Remote Sensing Images

Journal

ELECTRONICS
Volume 12, Issue 22, Pages -

Publisher

MDPI
DOI: 10.3390/electronics12224592

Keywords

computational intelligence; neural networks; building footprint extraction; attention mechanism; remote-sensing images

Ask authors/readers for more resources

With the development of VHR remote-sensing technology, automatic identification and extraction of building footprints play a significant role in urban development tracking. However, VHR technology, while characterizing building details accurately, also enhances background interference and noise, degrading the fine-grained detection of building footprints. To address these issues, this study proposes a denoising frequency attention network (DFANet) for extracting building footprints in VHR images, incorporating a denoising frequency attention module and a pyramid pooling module into the network architecture. Experimental results demonstrate the effectiveness and superiority of the proposed method, emphasizing the critical role it plays.
With the rapid development of very-high-resolution (VHR) remote-sensing technology, automatic identification and extraction of building footprints are significant for tracking urban development and evolution. Nevertheless, while VHR can more accurately characterize the details of buildings, it also inevitably enhances the background interference and noise information, which degrades the fine-grained detection of building footprints. In order to tackle the above issues, the attention mechanism is intensively exploited to provide a feasible solution. The attention mechanism is a computational intelligence technique inspired by the biological vision system capable of rapidly and automatically catching critical information. On the basis of the a priori frequency difference of different ground objects, we propose the denoising frequency attention network (DFANet) for building footprint extraction in VHR images. Specifically, we design the denoising frequency attention module and pyramid pooling module, which are embedded into the encoder-decoder network architecture. The denoising frequency attention module enables efficient filtering of high-frequency noises in the feature maps and enhancement of the frequency information related to buildings. In addition, the pyramid pooling module is leveraged to strengthen the adaptability and robustness of buildings at different scales. Experimental results of two commonly used real datasets demonstrate the effectiveness and superiority of the proposed method; the visualization and analysis also prove the critical role of the proposal.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available