4.5 Article

Pooling Attention-based Encoder-Decoder Network for semantic segmentation

Journal

COMPUTERS & ELECTRICAL ENGINEERING
Volume 93, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compeleceng.2021.107260

Keywords

Semantic segmentation; Encoder-Decoder; Pooling attention module; Channel; Position

Funding

  1. NSFC [U19A2083]
  2. Science and Technology Plan Project of Hunan Province, China [2016TP1020]
  3. open fund project of Hunan Provincial Key Laboratory of Intelligent Information Processing and Application for Hengyang normal university, China [IIPA20K04]

Ask authors/readers for more resources

This paper proposes an Encoder-Decoder network for image semantic segmentation using pooling SE-ResNet attention module, called PAEDN, to address the challenge of poor pixel-consistency in inter-category and pixel-similarity in inter-category. Experimental evaluations on PASCAL and Cityscapes datasets show that the proposed method achieves good pixel-consistency semantic label and a 15.1% improvement over FCN.
Aiming to the challenge of poor pixel-consistency in inter-category and pixel-similarity in inter-category, in this paper, we propose an Encoder-Decoder network for image semantic segmentation using pooling SE-ResNet attention module, called PAEDN. It is an effective of attention mechanism to get aggregated information. According to the principle of SE-ResNet, a collection of Average, Maximum and Stochastic global pooling, which concentrate on contoured, detailed, and generalized information in a certain semantic segmentation, form attention modules. Channel Pooling Attention Module (CPAM) and Position Pooling Attention Module (PPAM) are designed and integrated into the Encoder to extract discriminative features from input images, and the Decoder is developed through SE-ResNet attention module to fuse the feature map in high-resolution with that in low-resolution. Experimental evaluations performed on the data sets PASCAL and Cityscapes, show the proposed Encoder-Decoder with pooling attention module produces good pixel-consistency semantic label, achieves 15.1% improvement to FCN.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available