4.7 Article

Real-time semantic segmentation with local spatial pixel adjustment

Journal

IMAGE AND VISION COMPUTING
Volume 123, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.imavis.2022.104470

Keywords

Semantic segmentation; Real-time; Local spatial adjustment; Dual-branch decoding

Funding

  1. National Natural Science Founda-tion of Chain [62106214]
  2. Natural Science Founda-tion of Hebei Province [F201920311]

Ask authors/readers for more resources

The proposed LSPANet addresses the challenges faced by semantic segmentation networks in utilizing information and handling small-scale objects. By incorporating dual-branch decoding fusion and spatial pixel cross-correlation blocks, it achieves precise adjustment of each pixel value and faster inference speed.
The research of semantic segmentation networks has achieved a significant breakthrough recently. However, most part of methods have difficulty in utilizing information generated at each stage, which resulting in pixel value dislocation and blurred boundaries for small-scale objects. To overcome these challenges, a local spatial pixel adjustment network (LSPANet) is proposed in this paper, which mainly consists of a dual-branch decoding fusion (DDF) module and a spatial pixel cross-correlation (SPCC) block. Specifically, the DDF module takes the high-level and low-level feature maps with different stages as the input, and gradually eliminates the discrepancy in the information of the feature map to fuse a variety of information extracted in the encoder stage. The SPCC block adopts the horizontal spatial pixel adjustment (HSPA) module and the vertical spatial pixel adjustment (VSPA) module to capture the relationship of each pixel value in the local horizontal and vertical space respectively, and then assign the importance to all values based on this relationship. LSPANet is evaluated on Cityscapes and Camvid datasets. The experimental results show that our network achieves 77.1% mIoU with 2 M parameters on the challenging Cityscapes dataset and the inference speed exceeds 30 FPS in a single GTX 2080 Ti GPU.(c) 2022 Published by Elsevier B.V.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available