☆ 4.7 Article

Multi-scale object detection in remote sensing imagery with convolutional neural networks

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING (2018)

期刊

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING

卷 145, 期 -, 页码 3-22

出版社

ELSEVIER

DOI: 10.1016/j.isprsjprs.2018.04.003

关键词

Object detection; Deep learning; Convolutional neural networks; Multi-modal remote sensing images

类别

Geography, Physical Geosciences, Multidisciplinary Remote Sensing Imaging Science & Photographic Technology

资金

National Natural Science Foundation of China [61303186]
Fund of Innovation of National University of Defense Technology Graduate School [B150406]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Automatic detection of multi-class objects in remote sensing images is a fundamental but challenging problem faced for remote sensing image analysis. Traditional methods are based on hand-crafted or shallow-learning-based features with limited representation power. Recently, deep learning algorithms, especially Faster region based convolutional neural networks (FRCN), has shown their much stronger detection power in computer vision field. However, several challenges limit the applications of FRCN in multi-class objects detection from remote sensing images: (1) Objects often appear at very different scales in remote sensing images, and FRCN with a fixed receptive field cannot match the scale variability of different objects; (2) Objects in large-scale remote sensing images are relatively small in size and densely peaked, and FRCN has poor localization performance with small objects; (3) Manual annotation is generally expensive and the available manual annotation of objects for training FRCN are not sufficient in number. To address these problems, this paper proposes a unified and effective method for simultaneously detecting multi-class objects in remote sensing images with large scales variability. Firstly, we redesign the feature extractor by adopting Concatenated ReLU and Inception module, which can increases the variety of receptive field size. Then, the detection is preformed by two sub-networks: a multi-scale object proposal network (MS-OPN) for object-like region generation from several intermediate layers, whose receptive fields match different object scales, and an accurate object detection network (AODN) for object detection based on fused feature maps, which combines several feature maps that enables small and densely packed objects to produce stronger response. For large-scale remote sensing images with limited manual annotations, we use cropped image blocks for training and augment them with re-scalings and rotations. The quantitative comparison results on the challenging NWPU VHR-10 data set, aircraft data set, Aerial-Vehicle data set and SAR-Ship data set show that our method is more accurate than existing algorithms and is effective for multi-modal remote sensing images. (C) 2018 International Society for Photogrammetry and Remote Sensing, Inc. (ISPRS). Published by Elsevier B.V. All rights reserved.

Multi-scale object detection in remote sensing imagery with convolutional neural networks

期刊

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multi-scale object detection in remote sensing imagery with convolutional neural networks

期刊

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文