4.7 Review

Semantic segmentation of cracks: Data challenges and architecture

Journal

AUTOMATION IN CONSTRUCTION
Volume 135, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.autcon.2021.104110

Keywords

Semantic segmentation; Crack detection; Imbalanced data; Deep learning; Infrastructure monitoring

Funding

  1. Engineering and Physical Research Council [1835904]

Ask authors/readers for more resources

This paper analyzes the research on semantic segmentation in the presence of fine structures and the effectiveness of different approaches to address class imbalance. UNet architecture is found to outperform networks without pooling layers in both performance and training time. The authors argue that UNet architecture is the state of the art for crack segmentation. However, networks without pooling layers may become attractive when computational cost is not a concern.
Deep Learning (DL) semantic image segmentation is a technique used in several fields of research. The present paper analyses semantic crack segmentation as a case study to review the up to date research on semantic segmentation in the presence of fine structures and the effectiveness of established approaches to address the inherent class imbalance issue. The established UNet architecture is tested against networks consisting exclusively of stacked convolution without pooling layers (straight networks), with regard to the resolution of their segmentation results. Dice and Focal losses are also compared against each other to evaluate their effectiveness on highly imbalanced data. With the same aim, dropout and data augmentation approaches are tested, as additional regularizing mechanisms, to address the uneven distribution of the dataset. The experiments show that the good selection of the loss function has more impact in handling the class imbalance and boosting the detection performance than all the other regularizers with regards to segmentation resolution. Moreover, UNet, the architecture considered as reference, clearly outperforms the networks with no pooling layers both in performance and training time. The authors argue that UNet architectures, compared to the networks with no pooling layers, achieve high detection performance at a very low cost in terms of training time. Therefore, the authors consider such architecture as the state of the art for semantic segmentation of cracks. On the other hand, once computational cost is not an issue anymore thanks to constant improvements of technology, the application of networks without pooling layers might become attractive again because of their simplicity of and high performance.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available