4.6 Article

Adjacent Context Coordination Network for Salient Object Detection in Optical Remote Sensing Images

Journal

IEEE TRANSACTIONS ON CYBERNETICS
Volume 53, Issue 1, Pages 526-538

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2022.3162945

Keywords

Decoding; Optical imaging; Optical sensors; Optical fiber networks; Optical fibers; Object detection; Feature extraction; Adjacent context coordination; bifurcation-aggregation block (BAB); optical remote sensing images (RSIs); salient object detection (SOD)

Ask authors/readers for more resources

In this article, a novel adjacent context coordination network (ACCoNet) is proposed for salient object detection (SOD) in optical remote sensing images (RSIs). ACCoNet improves the performance of SOD by exploring the coordination of adjacent features in an encoder-decoder architecture and introduces local and adjacent branches to handle multilevel features. Additionally, a bifurcation-aggregation block (BAB) is introduced to capture contextual information by extending the capabilities of the classic decoder block.
Salient object detection (SOD) in optical remote sensing images (RSIs), or RSI-SOD, is an emerging topic in understanding optical RSIs. However, due to the difference between optical RSIs and natural scene images (NSIs), directly applying NSI-SOD methods to optical RSIs fails to achieve satisfactory results. In this article, we propose a novel adjacent context coordination network (ACCoNet) to explore the coordination of adjacent features in an encoder-decoder architecture for RSI-SOD. Specifically, ACCoNet consists of three parts: 1) an encoder; 2) adjacent context coordination modules (ACCoMs); and 3) a decoder. As the key component of ACCoNet, ACCoM activates the salient regions of output features of the encoder and transmits them to the decoder. ACCoM contains a local branch and two adjacent branches to coordinate the multilevel features simultaneously. The local branch highlights the salient regions in an adaptive way, while the adjacent branches introduce global information of adjacent levels to enhance salient regions. In addition, to extend the capabilities of the classic decoder block (i.e., several cascaded convolutional layers), we extend it with two bifurcations and propose a bifurcation-aggregation block (BAB) to capture the contextual information in the decoder. Extensive experiments on two benchmark datasets demonstrate that the proposed ACCoNet outperforms 22 state-of-the-art methods under nine evaluation metrics, and runs up to 81 fps on a single NVIDIA Titan X GPU. The code and results of our method are available at https://github.com/MathLee/ACCoNet.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available