3.8 Proceedings Paper

Weakly Supervised Video Salient Object Detection

出版社

IEEE COMPUTER SOC
DOI: 10.1109/CVPR46437.2021.01655

关键词

-

资金

  1. National Science Foundation of China [U1801265]
  2. CSIRO's Machine Learning and Artificial Intelligence Future Science Platform (MLAI FSP)

向作者/读者索取更多资源

The study introduces a weakly supervised video salient object detection model based on relabeled fixation guided scribble annotations, alleviating the burden of data annotation. It utilizes an Appearance-motion fusion module and bidirectional ConvLSTM framework for effective multi-modal learning and long-term temporal context modeling, incorporating a novel loss function and boosting strategy to enhance model performance. Extensive experimental results verify the effectiveness of the solution on six benchmark video saliency detection datasets.
Significant performance improvement has been achieved for fully-supervised video salient object detection with the pixel-wise labeled training datasets, which are time-consuming and expensive to obtain. To relieve the burden of data annotation, we present the first weakly supervised video salient object detection model based on relabeled fixation guided scribble annotations. Specifically, an Appearance-motion fusion module and bidirectional ConvLSTM based framework are proposed to achieve effective multi-modal learning and long-term temporal context modeling based on our new weak annotations. Further, we design a novel foreground-background similarity loss to further explore the labeling similarity across frames. A weak annotation boosting strategy is also introduced to boost our model performance with a new pseudo-label generation technique. Extensive experimental results on six benchmark video saliency detection datasets illustrate the effectiveness of our solution(1).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据