4.7 Article

Correlation analysis and text classification of chemical accident cases based on word embedding

期刊

PROCESS SAFETY AND ENVIRONMENTAL PROTECTION
卷 158, 期 -, 页码 698-710

出版社

ELSEVIER
DOI: 10.1016/j.psep.2021.12.038

关键词

Text mining; Correlation analysis; Text classification; Word embedding; Deep learning; Chemical accident cases

资金

  1. Technology Innovation Project of Hunan Province [2018GK1040]
  2. Natural Science Foundation of Shandong Province [ZR2020MB124]

向作者/读者索取更多资源

Accident precursors provide valuable clues for risk assessment and risk warning. Text-mining methods based on deep learning have been shown to be effective in extracting information from chemical accident cases. This study developed a text-mining method for chemical accident cases using word embedding and deep learning, which proved to be efficient in extracting useful insights and classifying different types and causes of accidents.
Accident precursors can provide valuable clues for risk assessment and risk warning. Trends such as the main characteristics, common causes, and high-frequency types of chemical accidents can provide references for formulating safety-management strategies. However, such information is usually documented in unstructured or semistructured free text related to chemical accident cases, and it can be costly to manually extract the information. Recently, text-mining methods based on deep learning have been shown to be very effective. This study, therefore, developed a text-mining method for chemical accident cases based on word embedding and deep learning. First, the word2vec model was used to obtain word vectors from a text corpus of chemical accident cases. Then, a bidirectional long short-term memory (LSTM) model with an attention mechanism was constructed to classify the types and causes of Chinese chemical accident cases. The case studies revealed the following results: 1) Common trends in chemical accidents (e.g., characteristics, causes, high-frequency types) could be obtained through correlation analysis based on word embedding; 2) The developed text-classification model could classify different types of accidents as fires, explosions, poisoning, and others, and the average p (73.1%) and r (72.5%) of the model achieved ideal performance for Chinese text classification; 3) The developed text-classification model could classify the causes of accidents as personal unsafe act, personal habitual behavior, unsafe conditions of equipment or materials and vulnerabilities management strategy; p and r were 63.6% for the causes of vulnerabilities management strategy, and the average p and r are both 60.7%; 4) the accident precursors of explosion, fire, and poisoning were obtained through correlation analyses of each high-frequency type of chemical accident case based on text classification; 5) the text-mining method can provide site managers with an efficient tool for extracting useful insights from chemical accident cases based on word embedding and deep learning. (c) 2021 Institution of Chemical Engineers. Published by Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据