4.6 Article

Few-shot learning-based RGB-D salient object detection: A case study

Journal

NEUROCOMPUTING
Volume 512, Issue -, Pages 142-152

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2022.09.019

Keywords

RGB-D salient object detection; Saliency detection; Few-shot learning; Depth map; Multi-modal detection

Funding

  1. NSFC [62176169, 62006162]
  2. Chengdu Key Research and Development Support Program [2019-YF09-00129-GX]
  3. SCU-Luzhou Municipal Peoples Government Strategic Cooperation Project [2020CDLZ-10, 2021CDLZ-13]

Ask authors/readers for more resources

RGB-D salient object detection is an important research task, but existing models often suffer from poor generalizability. This paper treats RGB-D salient object detection as a few-shot learning problem and introduces prior knowledge from a closely related task, RGB salient object detection, to enhance performance. Experimental results validate the feasibility of using few-shot learning techniques to improve RGB-D salient object detection.
RGB-D salient object detection (SOD) aims at detecting general attention-grabbing objects from paired RGB and depth image inputs, and recently has attracted increasing research attention. Despite that many advanced RGB-D SOD models are proposed, almost all of them focus on developing models in a fully supervised manner with a small training dataset that typically has only hundreds of RGB-D samples. This may inevitably incur poor generalizability of these models when being applied to real-world scenar-ios and applications. To narrow such a gap, we make the first attempt of treating RGB-D SOD as a few -shot learning (FSL) problem, and improve it by introducing extra prior knowledge from a closely related task, i.e., RGB SOD. Inspired by the general taxonomy of FSL techniques, we investigate from two perspec-tives, namely model and data, of transferring additional knowledge from the RGB SOD dataset to enhance RGB-D SOD performance. For the former, we employ multi-task learning with parameter sharing to con-strain the model space, whereas for the latter, we propose to generate the depth from RGB by using an off-the-shelf depth estimator. Representative middle-fusion and late-fusion models are trialed and vali-dated under such a FSL setup. Our experimental results and analyses confirm the feasibility of promoting RGB-D SOD via FSL techniques, while comparative study on different FSL techniques and detection strate-gies is conducted. We hope this work can serve as a catalyst for bringing RGB-D saliency detection into real applications, as well as for inspiring future works that apply few-shot learning to saliency detection and other multi-modal detection tasks.(c) 2022 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available