4.7 Article

ABNet: Adaptive Balanced Network for Multiscale Object Detection in Remote Sensing Imagery

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TGRS.2021.3133956

关键词

Feature extraction; Object detection; Remote sensing; Adaptive systems; Task analysis; Proposals; Detectors; Adaptive feature pyramid; context exploitation; local cross-channel attention; multiscale object detection; remote sensing image (RSI)

资金

  1. National Natural Science Foundation of China [U21B2041, U1864204, 61632018, 61825603]

向作者/读者索取更多资源

In this article, we propose an adaptive balanced network (ABNet) to address the challenges of remote sensing object detection. Our approach utilizes an enhanced effective channel attention mechanism (EECA), an adaptive feature pyramid network (AFPN) and a context enhancement module (CEM) to improve feature representation and capture more discriminative features. Experimental results demonstrate the superior performance of our approach.
Benefiting from the development of convolutional neural networks (CNNs), many excellent algorithms for object detection have been presented. Remote sensing object detection (RSOD) is a challenging task mainly due to: 1) complicated background of remote sensing images (RSIs) and 2) extremely imbalanced scale and sparsity distribution of remote sensing objects. Existing methods cannot effectively solve these problems with excellent detection accuracy and rapid speed. To address these issues, we propose an adaptive balanced network (ABNet) in this article. First, we design an enhanced effective channel attention (EECA) mechanism to improve the feature representation ability of the backbone, which can alleviate the obstacles of complex background on foreground objects. Then, to combine multiscale features adaptively in different channels and spatial positions, an adaptive feature pyramid network (AFPN) is designed to capture more discriminative features. Furthermore, considering that the original FPN ignores rich deep-level features, a context enhancement module (CEM) is proposed to exploit abundant semantic information for multiscale object detection. Experimental results on three public datasets demonstrate that our approach exhibits superior performance over baseline by only introducing less than 1.5M extra parameters.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据