4.7 Article

Multi-scale feature fusion network for pixel-level pavement distress detection

Journal

AUTOMATION IN CONSTRUCTION
Volume 141, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.autcon.2022.104436

Keywords

Deep learning; Encoder-decoder architecture; Pavement distress; Feature fusion; Semantic segmentation; Unmanned aerial vehicle (UAV)

Funding

  1. National Key Research and Development Project of China [2020YFB1600102]
  2. Natural Science Foundation of Jiangsu Province [BK20180149]

Ask authors/readers for more resources

A novel deep neural network architecture, W-segnet, based on multi-scale feature fusions, is proposed for pixel-wise distress segmentation in pavement conditions. Experimental results demonstrate the robustness of W-segnet in various scenarios, outperforming other state-of-the-art semantic segmentation models.
Automatic pavement distress detection is essential to monitoring and maintaining pavement condition. Currently, many deep learning-based methods have been utilized in pavement distress detection. However, distress segmentation remains as a challenge under complex pavement conditions. In this paper, a novel deep neural network architecture, W-segnet, based on multi-scale feature fusions, is proposed for pixel-wise distress segmentation. The proposed W-segnet concatenates distress location information with distress classification features in two symmetric encoder-decoder structures. Three major types of distresses: crack, pothole, and patch are segmented and the results were discussed. Experimental results show that the proposed W-segnet is robust in various scenarios, achieving a mean pixel accuracy (MPA) of 87.52% and a mean intersection over union (MIoU) of 75.88%. The results demonstrate that W-segnet outperforms other state-of-the-art semantic segmentation models of U-net, SegNet, and PSPNet. Comparison of cost of model training and inference indicates that W-segnet has the largest number of parameters, which needs a slightly longer training time while it does not increase the inference cost. Four public datasets were used to test the generalization ability of the proposed model and the results demonstrate that the W-segnet possesses well segmentation performance.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available