☆ 4.2 Article

Application of a Domain-specific BERT for Detection of Speech Recognition Errors in Radiology Reports

RADIOLOGY-ARTIFICIAL INTELLIGENCE (2022)

期刊

RADIOLOGY-ARTIFICIAL INTELLIGENCE

卷 4, 期 4, 页码 -

出版社

RADIOLOGICAL SOC NORTH AMERICA (RSNA)

DOI: 10.1148/ryai.210185

关键词

Computer Applications; Technology Assessment

类别

Computer Science, Artificial Intelligence Radiology, Nuclear Medicine & Medical Imaging

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study developed radiology-specific BERT models that can identify SR errors in radiology reports and suggest corrections.

Purpose: To develop radiology domain-specific bidirectional encoder representations from transformers (BERT) models that can identify speech recognition (SR) errors and suggest corrections in radiology reports. Materials and Methods: A pretrained BERT model, Clinical BioBERT, was further pretrained on a corpus of 114 008 radiology reports between April 2016 and August 2019 that were retrospectively collected from two hospitals. Next, the model was fine-tuned on a training dataset of generated insertion, deletion, and substitution errors, creating Radiology BERT. This model was retrospectively evaluated on an independent dataset of radiology reports with generated errors (n = 18 885) and on unaltered report sentences (n = 2000) and prospectively evaluated on true clinical SR errors (n = 92). Correction Radiology BERT was separately trained to suggest corrections for detected deletion and substitution errors. Area under the receiver operating characteristic curve (AUC) and bootstrapped 95% CIs were calculated for each evaluation dataset. Results: Radiology-specific BERT had AUC values of >.99 (95% CI: .0.99, .0.99), 0.94 (95% CI: 0.93, 0.94), 0.98 (95% CI: 0.98, 0.98), and 0.97 (95% CI: 0.97, 0.97) for detecting insertion, deletion, substitution, and all errors, respectively, on the independently generated test set. Testing on unaltered report impressions revealed a sensitivity of 82% (28 of 34; 95% CI: 70%, 93%) and specificity of 88% (1521 of 1728; 95% CI: 87%, 90%). Testing on prospective SR errors showed an accuracy of 75% (69 of 92; 95% CI: 65%, 83%). Finally, the correct word was the top suggestion for 45.6% (475 of 1041; 95% CI: 42.5%, 49.3%) of errors. Conclusion: Radiology-specific BERT models fine-tuned on generated errors were able to identify SR errors in radiology reports and suggest corrections. (C) RSNA, 2022

Application of a Domain-specific BERT for Detection of Speech Recognition Errors in Radiology Reports

期刊

RADIOLOGY-ARTIFICIAL INTELLIGENCE

出版社

RADIOLOGICAL SOC NORTH AMERICA (RSNA)

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Application of a Domain-specific BERT for Detection of Speech Recognition Errors in Radiology Reports

期刊

RADIOLOGY-ARTIFICIAL INTELLIGENCE

出版社

RADIOLOGICAL SOC NORTH AMERICA (RSNA)

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文