4.7 Article

A Novel Nonlocal-Aware Pyramid and Multiscale Multitask Refinement Detector for Object Detection in Remote Sensing Images

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TGRS.2021.3059450

关键词

Feature extraction; Detectors; Task analysis; Head; Visualization; Remote sensing; Neural networks; Attention; multiscale; multitask; object detection (OD); remote sensing (RS) images

资金

  1. National Natural Science Foundation of China [61922013, U1833203, 61772225]
  2. Beijing Natural Science Foundation [JQ20021, L191004]
  3. National Natural Science Foundation of Guangdong [2018B030311046]
  4. Guangdong University Key Platforms and Research Projects [2018KZDXM066, 2017KZDXM081, 2015KQNCX153]

向作者/读者索取更多资源

Object detection is an important task in computer vision and has wide applications in remote sensing. However, the complexity, large-scale variation, and dense instances in remote sensing pose significant challenges for object detection. To address these challenges, a novel Nonlocal-aware Pyramid and Multiscale Multitask Refinement Detector is proposed, which utilizes nonlocal-aware pyramid attention and multiscale refinement feature pyramid network to improve detection performance.
Object detection (OD) is an important task of computer vision and has been widely used in many fields, including remote sensing (RS). However, the complex scenes, large-scale variation, and dense instances of RS bring huge challenges to OD. To meet these challenges, a novel Nonlocal-aware Pyramid and Multiscale Multitask Refinement Detector (NPMMR-Det) is proposed. Specifically, nonlocal-aware pyramid attention (NP-Attention) is designed for guiding a neural network model to focus more on efficient features and suppress background noise. Then a multiscale refinement feature pyramid network (MSR-FPN) is proposed to fuse the multiscale context features extracted by the NP-Attention guided neural network and adjust the optimal receptive field. In order to use these features more effectively, a multitask refinement head called MTR-Head, with offset sharing and a modulation mechanism, is developed to refine the feature misalignment between the localization task and the classification task. Extensive experiments performed on two public RS data sets demonstrate that the proposed NPMMR-Det achieves competitive performance compared with state-of-the-art methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据