4.5 Article

STA-Net: spatial-temporal attention network for video salient object detection

期刊

APPLIED INTELLIGENCE
卷 51, 期 6, 页码 3450-3459

出版社

SPRINGER
DOI: 10.1007/s10489-020-01961-4

关键词

Multi-scale; Video salient object detection; Attention; Pyramid

向作者/读者索取更多资源

This paper systematically studies the role of spatial and temporal attention mechanism in video salient object detection, proposing a two-stage spatial-temporal attention network called STA-Net. By utilizing Multi-Scale-Spatial-Attention and Pyramid-Saliency-Shift-Aware modules, the network efficiently exploits multi-scale saliency information and dynamic object information, achieving compelling performance in video salient object detection task.
This paper conducts a systematic study on the role of spatial and temporal attention mechanism in the video salient object detection (VSOD) task. We present a two-stage spatial-temporal attention network, named STA-Net, which makes two major contributions. In the first stage, we devise a Multi-Scale-Spatial-Attention (MSSA) module to reduce calculation cost on non-salient regions while exploiting multi-scale saliency information. Such a sliced attention method offers an individual way to efficiently exploit the high-level features of the network with an enlarged receptive field. The second stage is to propose a Pyramid-Saliency-Shift-Aware (PSSA) module, which puts emphasis on the importance of dynamic object information since it offers a valid shift cue to confirm salient object and capture temporal information. Such a temporal detection module is able to encourage precise salient region detection. Exhaustive experiments show that the proposed STA-Net is effective for video salient object detection task, and achieves compelling performance in comparison with state-of-the-art.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据