4.7 Article

High Quality Object Detection for Multiresolution Remote Sensing Imagery Using Cascaded Multi-Stage Detectors

Journal

REMOTE SENSING
Volume 14, Issue 9, Pages -

Publisher

MDPI
DOI: 10.3390/rs14092091

Keywords

object detection; cascaded detectors; Intersection over Union (IoU) threshold; classification ensemble; bounding box regression; multiresolution remote sensing images

Funding

  1. Strategic Priority Research Program of the Chinese Academy of Sciences [XDA19030301]
  2. National Natural Science Foundation of China [41801360, 41601212, 41771403, 42001286]
  3. Fundamental Research Foundation of Shenzhen Science and Technology Program [KCXFZ202002011006298]
  4. Guangdong Basic and Applied Basic Research Foundation [2019A1515011501]

Ask authors/readers for more resources

Deep-learning-based object detectors have made significant improvements in object detection in remote sensing images. However, challenges still exist due to the variation in object scales and the imbalance between positive and negative samples. This paper proposes a Cascade R-CNN++ structure to address these challenges and demonstrates its effectiveness through experiments.
Deep-learning-based object detectors have substantially improved state-of-the-art object detection in remote sensing images in terms of precision and degree of automation. Nevertheless, the large variation of the object scales makes it difficult to achieve high-quality detection across multiresolution remote sensing images, where the quality is defined by the Intersection over Union (IoU) threshold used in training. In addition, the imbalance between the positive and negative samples across multiresolution images worsens the detection precision. Recently, it was found that a Cascade region-based convolutional neural network (R-CNN) can potentially achieve a higher quality of detection by introducing a cascaded three-stage structure using progressively improved IoU thresholds. However, the performance of Cascade R-CNN degraded when the fourth stage was added. We investigated the cause and found that the mismatch between the ROI features and the classifier could be responsible for the degradation of performance. Herein, we propose a Cascade R-CNN++ structure to address this issue and extend the three-stage architecture to multiple stages for general use. Specifically, for cascaded classification, we propose a new ensemble strategy for the classifier and region of interest (RoI) features to improve classification accuracy at inference. In localization, we modified the loss function of the bounding box regressor to obtain higher sensitivity around zero. Experiments on the DOTA dataset demonstrated that Cascade R-CNN++ outperforms Cascade R-CNN in terms of precision and detection quality. We conducted further analysis on multiresolution remote sensing images to verify model transferability across different object scales.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available