4.7 Article

Weakly-supervised semantic segmentation with saliency and incremental supervision updating

Journal

PATTERN RECOGNITION
Volume 115, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2021.107858

Keywords

Weakly-supervised; Semantic segmentation; Convolution neural networks

Funding

  1. National Natural Science Foundation of China [61772568, U1911401]
  2. Guangdong Basic and Applied Basic Research Foundation [2019A1515012029]
  3. Guangzhou Science and Technology Program [201804010288]
  4. Fundamental Research Funds for the Central Universities [18lgzd15]
  5. Guangdong Special Support Program

Ask authors/readers for more resources

The LSISU model utilizes saliency and incremental supervision updating to dynamically update segmentation supervision, complementing classification loss with an image saliency objective. By generating high-quality initial mask estimation through class-wise pooling strategy, LSISU effectively deals with object co-occurrence problem and achieves superior segmentation performance on benchmark datasets.
Weakly-supervised semantic segmentation aims at tackling the dense labeling task using weak supervision so as to reduce human annotation efforts. For weakly-supervised semantic segmentation using only image-level annotation, we propose a novel model of Learning with Saliency and Incremental Supervision Updating (LSISU), in which both the guidances of saliency prior and class information are jointly used and the segmentation supervision is dynamically updated. In the proposed LSISU, we present an image saliency objective complementary to classification loss, by which the trained weakly-supervised deep network can effectively deal with object co-occurrence problem. Meanwhile, we make full use of the class-wise pooling strategy to generate initial mask estimation of high quality. Given an initial annotation, a segmentation network is learned along with incremental supervision updating, which plays a role of region expansion and corrects the falsely estimated supervision for training images. The incremental supervision updating is performed on the fly and involves repeated usage of a fully connected conditional random field algorithm. LSISU achieves superior segmentation performance in terms of mIoU metric on benchmark datasets, which are 62.5% on the PASCAL VOC 2012 test set and 30.1% on the COCO val set. (c) 2021 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available