4.7 Article

Simultaneous pixel-level concrete defect detection and grouping using a fully convolutional model

期刊

出版社

SAGE PUBLICATIONS LTD
DOI: 10.1177/1475921720985437

关键词

Concrete surface defects; pixel-level detection; defect pixel grouping; fully convolutional model; YOLOv3; mask subnet

向作者/读者索取更多资源

The study introduces a novel fully convolutional model for detecting and grouping image pixels for individual defects, demonstrating good robustness and accuracy. Compared with existing instance segmentation and semantic segmentation networks, the proposed model has a distinct advantage in boundary delineation and accuracy.
Deep learning techniques have attracted significant attention in the field of visual inspection of civil infrastructure systems recently. Currently, most deep learning-based visual inspection techniques utilize a convolutional neural network to recognize surface defects either by detecting a bounding box of each defect or classifying all pixels on an image without distinguishing between different defect instances. These outputs cannot be directly used for acquiring the geometric properties of each individual defect in an image, thus hindering the development of fully automated structural assessment techniques. In this study, a novel fully convolutional model is proposed for simultaneously detecting and grouping the image pixels for each individual defect on an image. The proposed model integrates an optimized mask subnet with a box-level detection network, where the former outputs a set of position-sensitive score maps for pixel-level defect detection and the latter predicts a bounding box for each defect to group the detected pixels. An image dataset containing three common types of concrete defects, crack, spalling and exposed rebar, is used for training and testing of the model. Results demonstrate that the proposed model is robust to various defect sizes and shapes and can achieve a mask-level mean average precision (mAP) of 82.4% and a mean intersection over union (mIoU) of 75.5%, with a processing speed of about 10 FPS at input image size of 576 x 576 when tested on an NVIDIA GeForce GTX 1060 GPU. Its performance is compared with the state-of-the-art instance segmentation network Mask R-CNN and the semantic segmentation network U-Net. The comparative studies show that the proposed model has a distinct defect boundary delineation capability and outperforms the Mask R-CNN and the U-Net in both accuracy and speed.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据