4.5 Article

Spatial hierarchy perception and hard samples metric learning for high-resolution remote sensing image object detection

期刊

APPLIED INTELLIGENCE
卷 52, 期 3, 页码 3193-3208

出版社

SPRINGER
DOI: 10.1007/s10489-021-02335-0

关键词

Remote sensing; Object detection; Spatial hierarchy perception; Metric learning

资金

  1. State's Key Project of Research and Development Plan of China [2016YFC0600908]
  2. National Natural Science Foundation of China [61806206, 61772530]
  3. Natural Science Foundation of Jiangsu Province [BK20180639, BK20201346, BK20171192]
  4. Six Talent Peaks Project in Jiangsu Province [2015-DZXX-010, 2018-XYDXX-044]

向作者/读者索取更多资源

A novel remote sensing image object detection method was proposed in this paper, which utilizes a spatial hierarchy perception component and hard samples metric learning to extract features under different spatial hierarchies, reduce feature differences of hard samples in the same category, and strengthen object feature learning by decoupling complex backgrounds. Experimental results demonstrate that the proposed method outperforms several state-of-the-art object detection methods in terms of detection performance.
Due to the different shooting angles, altitudes and scenes, remote sensing images contain many complex backgrounds and multi-scale objects. Moreover, objects in remote sensing images are much smaller relative to the backgrounds, easily occluded by buildings and trees. These cause difficult feature extraction and increase the intra-class diversity of objects, making object detection on remote sensing images more challenging. In this paper, we propose a novel remote sensing image object detection method (SHDet) based on spatial hierarchy perception component (SHPC) and hard samples metric learning (HSML). We design a SHPC to extract the feature under the different spatial hierarchies and learn the contribution weights between feature channels to enhance the feature representation. HSML is proposed to narrow the feature differences of hard samples in the same category, reducing the error detection caused by intra-class diversity. Besides, we decouple the complex background to build the pre-training datasets for pre-training the object detection model, strengthening the object feature learning. The experiments carried out on two widely used remote sensing datasets (NWPU VHR-10 and DOTA-v1.5) show that the proposed method has better detection performance compared with several state-of-the-art object detection methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据