☆ 4.6 Article

A position-aware attention network with progressive detailing for land use semantic segmentation of Remote Sensing images

INTERNATIONAL JOURNAL OF REMOTE SENSING (2023)

Journal

INTERNATIONAL JOURNAL OF REMOTE SENSING

Volume 44, Issue 21, Pages 6762-6801

Publisher

TAYLOR & FRANCIS LTD

DOI: 10.1080/01431161.2023.2274820

Keywords

Semantic Segmentation; Remote Sensing Images; Attention mechanism; Land-use Classification

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This article introduces a specific attention-based network called PaANet for semantic segmentation of remote sensing images. By incorporating position-aware attention and pyramid pooling expectation-maximization modules, this method significantly improves recognition accuracy and the continuity of ground object recognition while preserving structural classification details. The research also proposes a multiresolution data augmentation method that further enhances the model's performance and generalization ability.

Deep learning has achieved remarkable success in the semantic segmentation of remote sensing images (RSIs).In the domain of semantic segmentation, where classification and localization tasks need to be performed simultaneously, it is crucial to consider both global and local spatial relationships in RSIs. This is especially important for the recognition of ground objects that have a slim and elongated appearance. However, existing methods for land use semantic segmentation lack an effective mechanism to coordinate and address these two aspects, resulting in limitations on the recognition of slim targets and the continuity of land object identification. Here, a specific attention-based network called PaANet is developed for semantic segmentation. Our proposed framework builds upon the Swin transformer by incorporating two key modules: the position-aware attention (PaA) module and the pyramid pooling expectation-maximization (PPEM) module. These modules provide significant improvements in recognition accuracy and the continuity of ground object recognition while preserving structural classification details. Furthermore, we propose a multiresolution data augmentation method that utilizes scale-related information to guide the encoder. This approach leads to improved performance and generalization ability for the model. In experiments, the mIoU of our approach for the BLU and GID datasets is 2.37% and 3.94% higher than that of the baseline model, respectively. Our results also show significant superior to those of other methods regarding the continuity of ground object recognition.

A position-aware attention network with progressive detailing for land use semantic segmentation of Remote Sensing images

Journal

INTERNATIONAL JOURNAL OF REMOTE SENSING

Publisher

TAYLOR & FRANCIS LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A position-aware attention network with progressive detailing for land use semantic segmentation of Remote Sensing images

Journal

INTERNATIONAL JOURNAL OF REMOTE SENSING

Publisher

TAYLOR & FRANCIS LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper