☆ 4.7 Article

Hybrid semantic segmentation for tunnel lining cracks based on Swin Transformer and convolutional neural network

COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING (2023)

Journal

COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING

Volume -, Issue -, Pages -

Publisher

WILEY

DOI: 10.1111/mice.13003

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

In this paper, a hybrid semantic segmentation algorithm named SCDeepLab is proposed, which addresses the limitations of CNN-based algorithms in capturing global features and the weaknesses of Transformer-based networks in losing local features. SCDeepLab combines the advantages of Swin Transformer and CNN to effectively extract both global semantic and local detailed information, resulting in higher segmentation accuracy compared to previous methods.

In the field of tunnel lining crack identification, the semantic segmentation algorithms based on convolution neural network (CNN) are extensively used. Owing to the inherent locality of CNN, these algorithms cannot make full use of context semantic information, resulting in difficulty in capturing the global features of crack. Transformer-based networks can capture global semantic information, but this method also has the deficiencies of strong data dependence and easy loss of local features. In this paper, a hybrid semantic segmentation algorithm for tunnel lining crack, named SCDeepLab, is proposed by fusing Swin Transformer and CNN in the encoding and decoding framework of DeepLabv3+ to address the above issues. In SCDeepLab, a joint backbone network is introduced with CNN-based Inverse Residual Block and Swin Transformer Block. The former is used to extract the local detailed information of the crack to generate the shallow feature layer, whereas the latter is used to extract the global semantic information to obtain the deep feature layer. In addition, Efficient Channel Attention enhanced Feature Fusion Module is proposed to fuse the shallow and deep features to combine the advantages of the two types of features. Furthermore, the strategy of transfer learning is adopted to solve the data dependency of Swin Transformer. The results show that the mean intersection over union (mIoU) and mean pixel accuracy (mPA) of SCDeepLab on the data sets constructed in this paper are 77.41% and 84.42%, respectively, which have higher segmentation accuracy than previous CNN-based and transformer-based semantic segmentation algorithms.

Hybrid semantic segmentation for tunnel lining cracks based on Swin Transformer and convolutional neural network

Journal

COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING

Publisher

WILEY

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Hybrid semantic segmentation for tunnel lining cracks based on Swin Transformer and convolutional neural network

Journal

COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING

Publisher

WILEY

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper