4.7 Article

Align Deep Features for Oriented Object Detection

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TGRS.2021.3062048

关键词

Feature extraction; Detectors; Object detection; Location awareness; Convolution; Task analysis; Computer architecture; Aerial images; deep learning; feature alignment; object detection

资金

  1. National Natural Science Foundation of China (NSFC) [61922065, 61771350, 41820104006, 61871299]
  2. Science and Technology Major Project of Hubei Province (Next-Generation AI Technologies) [2019AEA170]
  3. Shanghai Aerospace Science and Technology Innovation Project [SAST2019-094]

向作者/读者索取更多资源

Significant progress has been made in the past decade on detecting objects in aerial images. We propose a single-shot alignment network (S(2)A-Net) that consists of two modules to address the misalignment issue between anchors and convolutional features, improving the consistency between classification score and localization accuracy.
The past decade has witnessed significant progress on detecting objects in aerial images that are often distributed with large-scale variations and arbitrary orientations. However, most of existing methods rely on heuristically defined anchors with different scales, angles, and aspect ratios, and usually suffer from severe misalignment between anchor boxes (ABs) and axis-aligned convolutional features, which lead to the common inconsistency between the classification score and localization accuracy. To address this issue, we propose a single-shot alignment network (S(2)A-Net) consisting of two modules: a feature alignment module (FAM) and an oriented detection module (ODM). The FAM can generate high-quality anchors with an anchor refinement network and adaptively align the convolutional features according to the ABs with a novel alignment convolution. The ODM first adopts active rotating filters to encode the orientation information and then produces orientation-sensitive and orientation-invariant features to alleviate the inconsistency between classification score and localization accuracy. Besides, we further explore the approach to detect objects in large-size images, which leads to a better trade-off between speed and accuracy. Extensive experiments demonstrate that our method can achieve the state-of-the-art performance on two commonly used aerial objectsx2019; data sets (i.e., DOTA and HRSC2016) while keeping high efficiency.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据