4.3 Article

Multi-source pseudo-label learning of semantic segmentation for the scene recognition of agricultural mobile robots

Journal

ADVANCED ROBOTICS
Volume 36, Issue 19, Pages 1011-1029

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/01691864.2022.2109427

Keywords

Semantic segmentation; deep learning; scene recognition; unsupervised domain adaptation (UDA)

Categories

Funding

  1. Knowledge Hub Aichi Priority Research Project
  2. Leading Graduate School Program, 'Innovative program for training brainscience-information-architects by analysis of massive quantities of highly technical information about the brain,' by the Ministry of Education, Culture, Sports, Science and Technology,

Ask authors/readers for more resources

This paper describes a novel method of training a semantic segmentation model for scene recognition of agricultural mobile robots using multiple publicly available datasets. The proposed method leverages pre-trained models from each source dataset to generate pseudo-labels for the target images and allows for knowledge transfer from multiple sources. Existing state-of-the-art methods are also introduced to suppress the effect of noise in the pseudo-labels and improve performance.
This paper describes a novel method of training a semantic segmentation model for scene recognition of agricultural mobile robots exploiting multiple publicly available datasets that are different from the target greenhouse environments. Semantic segmentation models require abundant labels given by tedious manual annotation for training. Although unsupervised domain adaptation (UDA) is studied as a workaround for such a problem, existing UDA methods assume a source dataset similar to the target dataset, which is not available for greenhouse scenes. In this paper, we propose a method to train a semantic segmentation model for greenhouse images leveraging multiple publicly available datasets not dedicated to greenhouses. We exploit segmentation models pre-trained on each source dataset to generate pseudo-labels for the target images based on agreement of all the pre-trained models on each pixel. The proposed method allows for effectively transferring the knowledge from multiple sources rather than relying on a single dataset and realizes precise training of semantic segmentation model. We also introduce existing state-of-the-art methods to suppress the effect of noise in the pseudo-labels to further improve the performance. We demonstrate that our proposed method outperforms existing UDA methods and a supervised SVM-based method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available