4.5 Article

Convolutional Neural Network Algorithm-Based Novel Automatic Text Classification Framework for Construction Accident Reports

出版社

ASCE-AMER SOC CIVIL ENGINEERS
DOI: 10.1061/JCEMD4.COENG-13523

关键词

Deep learning; Natural language processing (NLP); Construction safety; Text classification; Accident injury types

向作者/读者索取更多资源

This paper proposes a text self-classification model based on deep learning natural language processing (NLP) technology for automated classification of construction site accident cases by accident type. The model utilizes pretrained Word2Vec word embeddings and a convolutional neural network (CNN) model to achieve excellent feature extraction and learning abilities. This research provides a useful method for obtaining reliable accident prevention knowledge from textual descriptions.
Construction sites remain one of the most hazardous workplaces globally. To improve workplace safety in the construction industry and reduce the personal injuries and socioeconomic impacts resulting from workplace accidents, tacit knowledge containing fundamental causes of accidents or specific contextual factors can be extracted from past accident narrative reports. However, manually analyzing unstructured or semistructured textual data stored in records is a daunting task, and requires the use of automated and intelligent technologies to achieve rapid and accurate knowledge acquisition. Therefore, this paper proposes a text self-classification model based on deep learning natural language processing (NLP) technology for automated classification of construction site accident cases by accident type. First, combined with two statistical measures, mutual information and information entropy, the preprocessed text data were subjected to phrase segmentation to identify more complete and accurate accident precursor information without human intervention. Then a complete multilayer and multisize convolutional neural network (CNN) model was constructed using pretrained Word2Vec word embeddings for text self-classification tasks. Finally, the test results of the CNN classification algorithm were compared with the practical application results of three shallow learning algorithms, and the performance of different types of classification algorithms was evaluated. The results showed that the CNN-based deep learning algorithm developed in this paper demonstrated excellent feature extraction and learning abilities in the task of automatic text classification in the field of NLP. This not only demonstrated that reliable accident prevention knowledge could be obtained from the textual descriptions of construction accidents, but also provided a novel model reference for document archiving and information retrieval.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据