4.7 Article

Object localization and edge refinement network for salient object detection

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 213, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2022.118973

Keywords

Transformer features; Information multiple selection; Object localization and edge refinement; Dual edge labels

Ask authors/readers for more resources

In this paper, a method for saliency detection using transformer and object localization edge refinement network (OLER) is proposed. The network consists of two stages and utilizes an information multiple selection module and an edge generation module. Experimental results show that the method has advantages in terms of detection accuracy and efficiency.
Most existing methods mainly input images into a CNN backbone to obtain image features. However, compared with convolutional features, the recently emerging transformer features can more accurately express the meaningful features of images. In this paper, we use a transformer backbone to capture multiple feature layers of an image, and design an Object Localization and Edge Refinement (OLER) Network for saliency detection. Our network is divided into two stages, the first stage for object positioning and the second stage for refining their boundaries. In the first stage, we directly apply multiple feature layers to identify salient regions, where we design an Information Multiple Selection (IMS) module to capture saliency cues for each feature layer. The IMS module contains multiple pathways, each of which is a judgment of the location of saliency information. After the input feature layer is processed by the IMS module, its potential salient object information is mined. The second stage consists of two modules, namely the edge generation module and the edge refinement module. The edge generation module takes the original image and saliency map as inputs, and then outputs two edge maps focusing on different edge ranges. To make the object edges sharp, the original image, initial saliency map and two edge maps are fed into the edge refinement module, and the final saliency map is output. Our network as a whole is relatively simple and easy to build without involving complex components. Experimental results on five public datasets demonstrate that our method has tremendous advantages in terms of not only significantly improving detection accuracy, but also achieving better detection efficiency. The code is available at https://github.com/CKYiu/OLER.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available