☆ 4.7 Article

RTLSeg: A novel multi-component inspection network for railway track line based on instance segmentation

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2023)

期刊

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

卷 119, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.engappai.2023.105822

关键词

Multi-component inspection; Instance segmentation; YOLACT; Feature pyramid network; Attention mechanism; CoordConv

类别

Automation & Control Systems Computer Science, Artificial Intelligence Engineering, Multidisciplinary Engineering, Electrical & Electronic

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper investigates an innovative and intelligent method for multi-component identification and common defect detection of railway track line based on instance segmentation. A railway track line image dataset is constructed and annotated manually, and a railway track line image segmentation model (RTLSeg) is proposed. Experimental results show that the proposed method is effective and outperforms the compared baseline models.

The condition monitoring of railway track line is one of the fundamental tasks to ensure the safety of the railway transportation system. Railway track line is mainly made up of tracks, fasteners, bolts, backing plates, and so on. Given the requirements for rapid and accurate inspection, an innovative and intelligent method for multi-component identification and common defect detection of railway track line is investigated based on instance segmentation. More specifically, a railway track line image (RTL-I) dataset is constructed and annotated manually in this paper. After that, based on the work of YOLACT and YOLACT++, combined with prior knowledge, a railway track line image segmentation model (RTLSeg for short) is proposed. Firstly, taking the characteristics of the objects in the RTL-I dataset, preset anchors are redesigned and a feature enhanced module is introduced in the prediction head to improve the detection and segmentation accuracy of the model. Secondly, to strengthen the internal information propagation within the model, PaFPN (path aggregation feature pyramid network) is applied instead of FPN in RTLSeg. Thirdly, with the help of CoordConv, Coord-Protonet is presented to add position awareness explicitly to the model for more robust and higher quality prototype masks. Finally, to further improve the model performance, the attention mechanism is explored and a novel spatial attention-guided bounding box branch is employed in the enhanced prediction head. Both quantitative and qualitative experimental results show that the proposed method is feasible in detecting and segmenting multi-component and common defects of railway track line, and outperforms the compared baseline models. In particular, RTLSeg is able to achieve 91.35 bbox mAP and 91.60 mask mAP with the customized dataset. Meanwhile, the average inference speed reaches 13.07 fps. The average detection accuracy and recall are 100% and 99.83%, respectively. Furthermore, the effectiveness of each optimized part of the proposed RTLSeg model is demonstrated by additional ablation study.

RTLSeg: A novel multi-component inspection network for railway track line based on instance segmentation

期刊

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

RTLSeg: A novel multi-component inspection network for railway track line based on instance segmentation

期刊

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文