☆ 4.7 Article

Multiscale Attention Networks for Pavement Defect Detection

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT (2023)

期刊

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT

卷 72, 期 -, 页码 -

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIM.2023.3298391

关键词

Attention module; deep neural network; image identification; multiscale convolution; pavement defect detection

类别

Engineering, Electrical & Electronic Instruments & Instrumentation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The researchers propose a multiscale mobile attention-based network called MANet to automatically detect pavement defects. The approach utilizes an encoder-decoder architecture with MobileNet as the backbone network and incorporates multiscale convolution kernels and hybrid attention mechanisms. The proposed approach achieves state-of-the-art performance on benchmark datasets and provides satisfactory results in practical scenarios. Rating: 9/10

Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role in reducing the harm of various pavement defects. Particularly, the recent development in deep learning (DL)-based convolution neural networks (CNNs) has shown competitive performance in image detection and classification. To detect pavement defects automatically and improve effects, a multiscale mobile attention-based network, which we termed MANet, is proposed to perform the detection of pavement defects. The architecture of the encoder-decoder is used in MANet, where the encoder adopts the MobileNet as the backbone network to extract pavement defect features. Instead of the original 3 x 3 convolution, the multiscale convolution kernels are used in depthwise separable convolution (DSConv) layers of the network. Furthermore, the hybrid attention mechanism is separately incorporated into the encoder and decoder modules to infer the significance of spatial points and interchannel relationship features for the input intermediate feature maps. The proposed approach achieves state-of-the-art performance on two publicly available benchmark datasets, i.e., the Crack500 (500 crack images with 2000 x 1500 pixels) and CFD (118 crack images with 480 x 320 pixels) datasets. The mean intersection over union (MIoU) of the proposed approach on these two datasets reaches 0.7219 and 0.7788, respectively. Ablation experiments show that the multiscale convolution and hybrid attention modules can effectively help the model extract high-level feature representations and generate more accurate pavement crack segmentation results. We further test the model on locally collected pavement crack images (131 images with 1024 x 768 pixels) and it achieves a satisfactory result. The proposed approach realizes the MIoU of 0.6514 on the local dataset and outperforms other compared baseline methods. Experimental findings demonstrate the validity and feasibility of the proposed approach and it provides a viable solution for pavement crack detection in practical application scenarios. Our code is available at https://github.com/xtu502/pavement-defects.

Multiscale Attention Networks for Pavement Defect Detection

期刊

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multiscale Attention Networks for Pavement Defect Detection

期刊

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文