3.8 Proceedings Paper

Salient Object Ranking with Position-Preserved Attention

Publisher

IEEE
DOI: 10.1109/ICCV48922.2021.01602

Keywords

-

Funding

  1. National Key Research and Development Program of China [2018AAA0101400]
  2. National Nature Science Foundation of China [62036009, U1909203, 61936006]
  3. Innovation Capability Support Program of Shaanxi [2021TD-05]
  4. Alibaba-Zhejiang University Joint Institute of Frontier Technologies

Ask authors/readers for more resources

This paper focuses on relative saliency in instance segmentation, proposing a novel SOR task framework and introducing a customized PPA module for the SOR task. Experimental results demonstrate the superior performance of the proposed method.
Instance segmentation can detect where the objects are in an image, but hard to understand the relationship between them. We pay attention to a typical relationship, relative saliency. A closely related task, salient object detection, predicts a binary map highlighting a visually salient region while hard to distinguish multiple objects. Directly combining two tasks by post-processing also leads to poor performance. There is a lack of research on relative saliency at present, limiting the practical applications such as content-aware image cropping, video summary, and image labeling. In this paper, we study the Salient Object Ranking (SOR) task, which manages to assign a ranking order of each detected object according to its visual saliency. We propose the first end-to-end framework of the SOR task and solve it in a multi-task learning fashion. The framework handles instance segmentation and salient object ranking simultaneously. In this framework, the SOR branch is independent and flexible to cooperate with different detection methods, so that easy to use as a plugin. We also introduce a Position-Preserved Attention (PPA) module tailored for the SOR branch. It consists of the position embedding stage and feature interaction stage. Considering the importance of position in saliency comparison, we preserve absolute coordinates of objects in ROI pooling operation and then fuse positional information with semantic features in the first stage. In the feature interaction stage, we apply the attention mechanism to obtain proposals' contextualized representations to predict their relative ranking orders. Extensive experiments have been conducted on the ASR dataset. Without bells and whistles, our proposed method outperforms the former state-of-the-art method significantly. The code will be released publicly available on https://github.com/EricFH/SOR.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available