☆ 4.6 Article

Few-shot learning-based RGB-D salient object detection: A case study

NEUROCOMPUTING (2022)

期刊

NEUROCOMPUTING

卷 512, 期 -, 页码 142-152

出版社

ELSEVIER

DOI: 10.1016/j.neucom.2022.09.019

关键词

RGB-D salient object detection; Saliency detection; Few-shot learning; Depth map; Multi-modal detection

类别

Computer Science, Artificial Intelligence

资金

NSFC [62176169, 62006162]
Chengdu Key Research and Development Support Program [2019-YF09-00129-GX]
SCU-Luzhou Municipal Peoples Government Strategic Cooperation Project [2020CDLZ-10, 2021CDLZ-13]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

RGB-D salient object detection is an important research task, but existing models often suffer from poor generalizability. This paper treats RGB-D salient object detection as a few-shot learning problem and introduces prior knowledge from a closely related task, RGB salient object detection, to enhance performance. Experimental results validate the feasibility of using few-shot learning techniques to improve RGB-D salient object detection.

RGB-D salient object detection (SOD) aims at detecting general attention-grabbing objects from paired RGB and depth image inputs, and recently has attracted increasing research attention. Despite that many advanced RGB-D SOD models are proposed, almost all of them focus on developing models in a fully supervised manner with a small training dataset that typically has only hundreds of RGB-D samples. This may inevitably incur poor generalizability of these models when being applied to real-world scenar-ios and applications. To narrow such a gap, we make the first attempt of treating RGB-D SOD as a few -shot learning (FSL) problem, and improve it by introducing extra prior knowledge from a closely related task, i.e., RGB SOD. Inspired by the general taxonomy of FSL techniques, we investigate from two perspec-tives, namely model and data, of transferring additional knowledge from the RGB SOD dataset to enhance RGB-D SOD performance. For the former, we employ multi-task learning with parameter sharing to con-strain the model space, whereas for the latter, we propose to generate the depth from RGB by using an off-the-shelf depth estimator. Representative middle-fusion and late-fusion models are trialed and vali-dated under such a FSL setup. Our experimental results and analyses confirm the feasibility of promoting RGB-D SOD via FSL techniques, while comparative study on different FSL techniques and detection strate-gies is conducted. We hope this work can serve as a catalyst for bringing RGB-D saliency detection into real applications, as well as for inspiring future works that apply few-shot learning to saliency detection and other multi-modal detection tasks.(c) 2022 Elsevier B.V. All rights reserved.

Few-shot learning-based RGB-D salient object detection: A case study

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Few-shot learning-based RGB-D salient object detection: A case study

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文