☆ 4.7 Article

Multiscale Information Fusion for Hyperspectral Image Classification Based on Hybrid 2D-3D CNN

REMOTE SENSING (2021)

Journal

REMOTE SENSING

Volume 13, Issue 12, Pages -

Publisher

MDPI

DOI: 10.3390/rs13122268

Keywords

convolutional neural network (CNN); hyperspectral image classification; multiscale information

Funding

National Natural Science Foundation of China (NSFC) [61775176]
National Major Special Projects of China [GFZX04014308]
Shaanxi Province Key Research and Development Program of China [2020GY-131, 2021SF-135]
Fundamental Research Funds for the Central Universities [xjh012020021]
Natural Science Foundation of Shanghai [18ZR1437200]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Hyperspectral imaging is widely used in classification due to its rich spectral information. This study introduces a lightweight multiscale squeeze-and-excitation pyramid pooling network (MSPN) to address the small sample problem in hyperspectral classification. The MSPN framework can learn and fuse deeper hierarchical spatial-spectral features with fewer training samples, achieving high classification accuracy on various datasets.

Hyperspectral images are widely used for classification due to its rich spectral information along with spatial information. To process the high dimensionality and high nonlinearity of hyperspectral images, deep learning methods based on convolutional neural network (CNN) are widely used in hyperspectral classification applications. However, most CNN structures are stacked vertically in addition to using a onefold size of convolutional kernels or pooling layers, which cannot fully mine the multiscale information on the hyperspectral images. When such networks meet the practical challenge of a limited labeled hyperspectral image dataset-i.e., small sample problem-the classification accuracy and generalization ability would be limited. In this paper, to tackle the small sample problem, we apply the semantic segmentation function to the pixel-level hyperspectral classification due to their comparability. A lightweight, multiscale squeeze-and-excitation pyramid pooling network (MSPN) is proposed. It consists of a multiscale 3D CNN module, a squeezing and excitation module, and a pyramid pooling module with 2D CNN. Such a hybrid 2D-3D-CNN MSPN framework can learn and fuse deeper hierarchical spatial-spectral features with fewer training samples. The proposed MSPN was tested on three publicly available hyperspectral classification datasets: Indian Pine, Salinas, and Pavia University. Using 5%, 0.5%, and 0.5% training samples of the three datasets, the classification accuracies of the MSPN were 96.09%, 97%, and 96.56%, respectively. In addition, we also selected the latest dataset with higher spatial resolution, named WHU-Hi-LongKou, as the challenge object. Using only 0.1% of the training samples, we could achieve a 97.31% classification accuracy, which is far superior to the state-of-the-art hyperspectral classification methods.

Multiscale Information Fusion for Hyperspectral Image Classification Based on Hybrid 2D-3D CNN

Journal

REMOTE SENSING

Publisher

MDPI

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Multiscale Information Fusion for Hyperspectral Image Classification Based on Hybrid 2D-3D CNN

Journal

REMOTE SENSING

Publisher

MDPI

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper