4.7 Article

Multiscale Information Fusion for Hyperspectral Image Classification Based on Hybrid 2D-3D CNN

Journal

REMOTE SENSING
Volume 13, Issue 12, Pages -

Publisher

MDPI
DOI: 10.3390/rs13122268

Keywords

convolutional neural network (CNN); hyperspectral image classification; multiscale information

Funding

  1. National Natural Science Foundation of China (NSFC) [61775176]
  2. National Major Special Projects of China [GFZX04014308]
  3. Shaanxi Province Key Research and Development Program of China [2020GY-131, 2021SF-135]
  4. Fundamental Research Funds for the Central Universities [xjh012020021]
  5. Natural Science Foundation of Shanghai [18ZR1437200]

Ask authors/readers for more resources

Hyperspectral imaging is widely used in classification due to its rich spectral information. This study introduces a lightweight multiscale squeeze-and-excitation pyramid pooling network (MSPN) to address the small sample problem in hyperspectral classification. The MSPN framework can learn and fuse deeper hierarchical spatial-spectral features with fewer training samples, achieving high classification accuracy on various datasets.
Hyperspectral images are widely used for classification due to its rich spectral information along with spatial information. To process the high dimensionality and high nonlinearity of hyperspectral images, deep learning methods based on convolutional neural network (CNN) are widely used in hyperspectral classification applications. However, most CNN structures are stacked vertically in addition to using a onefold size of convolutional kernels or pooling layers, which cannot fully mine the multiscale information on the hyperspectral images. When such networks meet the practical challenge of a limited labeled hyperspectral image dataset-i.e., small sample problem-the classification accuracy and generalization ability would be limited. In this paper, to tackle the small sample problem, we apply the semantic segmentation function to the pixel-level hyperspectral classification due to their comparability. A lightweight, multiscale squeeze-and-excitation pyramid pooling network (MSPN) is proposed. It consists of a multiscale 3D CNN module, a squeezing and excitation module, and a pyramid pooling module with 2D CNN. Such a hybrid 2D-3D-CNN MSPN framework can learn and fuse deeper hierarchical spatial-spectral features with fewer training samples. The proposed MSPN was tested on three publicly available hyperspectral classification datasets: Indian Pine, Salinas, and Pavia University. Using 5%, 0.5%, and 0.5% training samples of the three datasets, the classification accuracies of the MSPN were 96.09%, 97%, and 96.56%, respectively. In addition, we also selected the latest dataset with higher spatial resolution, named WHU-Hi-LongKou, as the challenge object. Using only 0.1% of the training samples, we could achieve a 97.31% classification accuracy, which is far superior to the state-of-the-art hyperspectral classification methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available