4.7 Article

Efficient Convolutional Neural Architecture Search for Remote Sensing Image Scene Classification

Journal

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
Volume 59, Issue 7, Pages 6092-6105

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TGRS.2020.3020424

Keywords

Remote sensing; Task analysis; Computer architecture; Feature extraction; Data models; Machine learning; Semantics; Convolutional neural networks (CNNs); neural architecture search (NAS); remote sensing image; scene classification

Funding

  1. Key Research and Development Plan of Innovation Chain of Industries in Shaanxi Province [2019ZDLGY09-05]
  2. National Natural Science Foundation of China [61772399, U1701267, 61773304, 61672405, 61772400]
  3. Technology Foundation for Selected Overseas Chinese Scholar in Shaanxi [2017021, 2018021]
  4. Program for Cheung Kong Scholars and Innovative Research Team in University [IRT_15R53]

Ask authors/readers for more resources

This article investigates a new paradigm to automatically design a suitable CNN architecture for scene classification, achieving impressive classification performance on remote sensing scene images through an efficient architecture search framework and dataset merging strategy.
As a fundamental but challenging task in the interpretation of remote sensing images, scene classification plays an important role in various applications and has become an active research topic. Many previous works have demonstrated the remarkable performance of the deep convolutional neural networks (CNNs) for remote sensing scene classification. However, the progress made by CNN-based methods for scene classification has gradually reached saturation in recent years, due to the serious dependence on the pretrained CNN models, the limitations of manually designed network architecture and the disadvantages of existing data sets. In this article, a new paradigm to automatically design a suitable CNN architecture for scene classification is investigated. We propose an efficient architecture search framework to discover optimal network architectures in continuous search space with the gradient-based optimization method. Our framework consists of two stages: the search phase and the evaluation phase. During the search process, a greedy and progressive search strategy is introduced to search network building blocks (i.e., cells) through bilevel optimization. Besides, we propose a simple architecture regularization scheme to further improve the search efficiency and the robustness of discovered architectures. After the search process, the optimal cell architectures are determined and then repeatedly stacked to construct the final network for evaluation. For the data set, we propose a mergence strategy to build a new large-scale remote sensing scene image data set that contains rich scene categories and image diversity, making it feasible to find a new CNN model with strong generalization ability for scene classification. Extensive experiments demonstrate the efficiency of the proposed search strategies and the impressive classification performance of searched CNN architectures on seven public benchmark data sets, including four large-scale data sets and three small-scale data sets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available