4.7 Article

RiDOP: A Rotation-Invariant Detector with Simple Oriented Proposals in Remote Sensing Images

期刊

REMOTE SENSING
卷 15, 期 3, 页码 -

出版社

MDPI
DOI: 10.3390/rs15030594

关键词

oriented object detection; object representation; rotation-invariant feature; image processing; remote sensing

向作者/读者索取更多资源

Compared with general object detection in natural images, oriented object detection in remote sensing images is challenging due to arbitrary orientations. Existing CNN-based methods often adopt complex and inefficient approaches to model variant orientations. In this paper, we propose a lightweight approach to generate oriented proposals and extract rotation-invariant features. Our method achieves state-of-the-art accuracy while reducing the model size by 40%.
Compared with general object detection with horizontal bounding boxes in natural images, oriented object detection in remote sensing images is an active and challenging research topic as objects are usually displayed in arbitrary orientations. To model the variant orientations of oriented objects, general CNN-based methods usually adopt more parameters or well-designed modules, which are often complex and inefficient. To address this issue, the detector requires two key components to deal with: (i) generating oriented proposals in a light-weight network to achieve effective representation of arbitrarily oriented objects; (ii) extracting the rotation-invariant feature map in both spatial and orientation dimensions. In this paper, we propose a novel, lightweight rotated region proposal network to produce arbitrary-oriented proposals by sliding two vertexes only on adjacent sides and adopt a simple yet effective representation to describe oriented objects. This may decrease the complexity of modeling orientation information. Meanwhile, we adopt the rotation-equivariant backbone to generate the feature map with explicit orientation channel information and utilize the spatial and orientation modules to obtain completely rotation-invariant features in both dimensions. Without tricks, extensive experiments performed on three challenging datasets DOTA-v1.0, DOTA-v1.5 and HRSC2016 demonstrate that our proposed method can reach state-of-the-art accuracy while reducing the model size by 40% in comparison with the previous best method.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据