☆ 4.5 Article

Global and pyramid convolutional neural network with hybrid attention mechanism for hyperspectral image classification

GEOCARTO INTERNATIONAL (2023)

Journal

GEOCARTO INTERNATIONAL

Volume 38, Issue 1, Pages -

Publisher

TAYLOR & FRANCIS LTD

DOI: 10.1080/10106049.2023.2226112

Keywords

Hyperspectral image classification; global convolution; multiscale convolution; neural network; attention mechanism

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

In this study, a global and pyramid convolutional network with a hybrid attention mechanism (GPHANet) is proposed for hyperspectral image classification. The model leverages a two-branch architecture to extract both local and global features and applies a compact hybrid attention mechanism (HAM) to capture long-range dependencies. Experimental results on three datasets demonstrate that GPHANet outperforms state-of-the-art methods.

Convolutional neural networks (CNNs) have shown impressive results in the hyperspectral image (HSI) classification. However, they still face certain limitations that can impact their effectiveness. The kernels in standard convolution have fixed spatial sizes and spectral depths, which cannot capture global semantic features from HSI and the existing methods for extracting multiscale information are inadequate. Based on these, we propose a global and pyramid convolutional network with a hybrid attention mechanism (GPHANet) for HSI classification. GPHANet adopts a two-branch architecture to extract both local and global hyperspectral features, leveraging the strengths of each to enhance classification performance. The local branch employs dynamic pyramid convolution, which customizes parameters for convolutional kernels of multiple scales based on the input image. This design enables the model to accurately and comprehensively extract multiscale feature information, capturing subtle variations in the HSI. The global branch utilizes circular convolution with a global receptive field, enabling each position in the convolutional layers to gather information from the entire input space. This allows the model to learn global contextual features, enhancing its understanding of the overall structure and semantics of the HSI. After extracting deep global and local features, we apply a compact hybrid attention mechanism (HAM) to capture long-range dependencies along both spatial and spectral dimensions, resulting in a more discriminative feature representation. On the Pavia University, Salinas, and Hong Hu datasets, the proposed model achieved overall accuracies of 99.22%, 98.74%, and 96.80%, respectively, using limited training samples. These results are much better compared to the state-of-the-art methods.

Global and pyramid convolutional neural network with hybrid attention mechanism for hyperspectral image classification

Journal

GEOCARTO INTERNATIONAL

Publisher

TAYLOR & FRANCIS LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Global and pyramid convolutional neural network with hybrid attention mechanism for hyperspectral image classification

Journal

GEOCARTO INTERNATIONAL

Publisher

TAYLOR & FRANCIS LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper