4.7 Article

A Self-Training Hierarchical Prototype-based Ensemble Framework for Remote Sensing Scene Classification

Journal

INFORMATION FUSION
Volume 80, Issue -, Pages 179-204

Publisher

ELSEVIER
DOI: 10.1016/j.inffus.2021.11.014

Keywords

self-training; pseudo-labelling; prototypes; remote sensing; scene classification

Ask authors/readers for more resources

A novel semi-supervised ensemble framework was proposed for remote sensing scene classification, utilizing a self-training hierarchical prototype-based classifier to address the challenges of labelled data scarcity and scene complexity. Experimental results demonstrated significant improvements in classification accuracy on popular benchmark datasets with limited labelled images available.
Remote sensing scene classification plays a critical role in a wide range of real-world applications. Technically, however, scene classification is an extremely challenging task due to the huge complexity in remotely sensed scenes, and the difficulty in acquiring labelled data for model training such as supervised deep learning. To tackle these issues, a novel semi-supervised ensemble framework is proposed here using the self-training hierarchical prototype-based classifier as the base learner for chunk-by-chunk prediction. The framework has the ability to build a powerful ensemble model from both labelled and unlabelled images with minimum supervision. Different feature descriptors are employed in the proposed ensemble framework to offer multiple independent views of images. Thus, the diversity of base learners is guaranteed for ensemble classification. To further increase the overall accuracy, a novel cross-checking strategy was introduced to enable the base learners to exchange pseudolabelling information during the self-training process, and maximize the correctness of pseudo-labels assigned to unlabelled images. Extensive numerical experiments on popular benchmark remote sensing scenes demonstrated the effectiveness of the proposed ensemble framework, especially where the number of labelled images available is limited. For example, the classification accuracy achieved on the OPTIMAL-31, PatternNet and RSI-CB256 datasets was up to 99.91%, 98. 67% and 99.07% with only 40% of the image sets used as labelled training images, surpassing or at least on par with mainstream benchmark approaches trained with double the number of labelled images.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available