4.7 Article

Efficient Context-Guided Stacked Refinement Network for RGB-T Salient Object Detection

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCSVT.2021.3102268

Keywords

Feature extraction; Task analysis; Fuses; Object detection; Image segmentation; Semantics; Lighting; Salient object detection; RGB-T; multi-modality; information fusion

Funding

  1. Natural Science Foundation of Chongqing, China [cstc2020jcyj-msxmX0825]
  2. Fundamental Research Funds for the Central Universities [2020CDJ-LHZZ-078]

Ask authors/readers for more resources

The study proposed an efficient encoder-decoder network named Context-guided Stacked Refinement Network (CSRNet), which reduces computational cost using a lightweight backbone and efficient decoder parts, fuses RGB and T modalities through the Context-guided Cross Modality Fusion (CCMF) module, and refines features progressively via the Stacked Refinement Network (SRN).
RGB-T salient object detection (SOD) aims at utilizing the complementary cues of RGB and Thermal (T) modalities to detect and segment the common objects. However, on one hand, existing methods simply fuse the features of two modalities without fully considering the characters of RGB and T. On the other hand, the high computational cost of existing methods prevents them from real-world applications (e.g., automatic driving, abnormal detection, person re-ID). To this end, we proposed an efficient encoder-decoder network named Context-guided Stacked Refinement Network (CSRNet). Specifically, we utilize a lightweight backbone and design efficient decoder parts, which greatly reduce the computational cost. To fuse RGB and T modalities, we proposed an efficient Context-guided Cross Modality Fusion (CCMF) module to filter the noise and explore the complementation of two modalities. Besides, Stacked Refinement Network (SRN) progressively refines the features from top to down via the interaction of semantic and spatial information. Extensive experiments show that our method performs favorably against state-of-the-art algorithms on RGB-T SOD task while with small model size (4.6M), few FLOPs (4.2G), and real-time speed (38 fps). Our codes is available at: https://github.com/huofushuo/CSRNet.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available