4.7 Article

Distinguishing foreground and background alignment for unsupervised domain adaptative semantic segmentation

Journal

IMAGE AND VISION COMPUTING
Volume 124, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.imavis.2022.104513

Keywords

Semantic segmentation; Self -supervised learning; pseudo labels; Attention mechanism; Focal loss

Funding

  1. National Natural Science Foundation of China [61966004, 61866004]
  2. Guangxi Natural Science Foundation [2019GXNSFDA245018]
  3. Guangxi Bagui Scholar Teams for Innovation and Research Project
  4. Guangxi Talent Highland Project of Big Data Intelligence and Application, Guangxi Collaborative Innovation Center of Multi -source Information Integration and Intelligent Processing

Ask authors/readers for more resources

Unsupervised domain adaptive semantic segmentation utilizes knowledge learned from labeled source domain dataset to guide segmentation in the target domain. Pseudo labels are generated using a self-supervised learning method to align corresponding pixels with the source domain based on segmentation loss. A channel and spatial parallel attention module is employed to extract rich spatial and channel information from the feature map, and focal loss is introduced to address class imbalance in the dataset.
Unsupervised domain adaptive semantic segmentation uses the knowledge learned from the labeled source domain dataset to guide the segmentation of the target domain. However, this domain migration method will cause a large inter-domain difference due to the different feature distributions between the source domain and the target domain. We use the self-supervised learning method to generate pseudo labels for the target domain, so that the corresponding pixels are directly aligned with the source domain according to the segmentation loss. Through observation, it is found that the spatial distribution of the background class in the source domain and the target domain has a small difference, and the appearance of the same class of the foreground class will also be quite different. We use the method of distinguishing alignment between foreground and background classes. We understand that acquiring the rich space and channel information on the feature map during the convolution process is essential for fine-grained semantic segmentation. Therefore, in order to obtain the dependency relationship between the channels of the feature map and the spatial position information, we use a channel and spatial parallel attention module. This module enables the network to select and amplify valuable space and channel information from the global information and suppress useless information. In addition, we introduce focal loss to solve the problem of class imbalance in the data set. Experiments show that our method achieves better segmentation performance in unsupervised domain adaptive semantic segmentation. (c) 2022 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available