☆ 4.7 Article

A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text

COMPUTERS IN BIOLOGY AND MEDICINE (2021)

期刊

COMPUTERS IN BIOLOGY AND MEDICINE

卷 130, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.compbiomed.2021.104216

关键词

Text mining negation detection context; disambiguation clinical information extraction

类别

Biology Computer Science, Interdisciplinary Applications Engineering, Biomedical Mathematical & Computational Biology

资金

NIHR Birmingham ECMC
NIHR Birmingham SRMRC
Nanocommons H2020-EU [731032]
NIHR Birmingham Biomedical Research Centre
MRC HDR UK - UK Research and Innovation, Department of Health and Social Care (England) [HDRUK/CFC/01]
King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research (OSR) [URF/1/3790-01-01]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The study presents a heuristic algorithm for negation detection based on dependency graphs, showing strong performance in clinical text mining without the need for complex rule development and adaptation. Comparing with other rule-based algorithms, the algorithm may overlook advanced cases in certain situations, but it remains a fast, powerful, and stable alternative method.

Negation detection is an important task in biomedical text mining. Particularly in clinical settings, it is of critical importance to determine whether findings mentioned in text are present or absent. Rule-based negation detection algorithms are a common approach to the task, and more recent investigations have resulted in the development of rule-based systems utilising the rich grammatical information afforded by typed dependency graphs. However, interacting with these complex representations inevitably necessitates complex rules, which are time-consuming to develop and do not generalise well. We hypothesise that a heuristic approach to determining negation via dependency graphs could offer a powerful alternative. We describe and implement an algorithm for negation detection based on grammatical distance from a negatory construct in a typed dependency graph. To evaluate the algorithm, we develop two testing corpora comprised of sentences of clinical text extracted from the MIMIC-III database and documents related to hypertrophic cardiomyopathy patients routinely collected at University Hospitals Birmingham NHS trust. Gold-standard validation datasets were built by a combination of human annotation and examination of algorithm error. Finally, we compare the performance of our approach with four other rule-based algorithms on both gold-standard corpora. The presented algorithm exhibits the best performance by f-measure over the MIMIC-III dataset, and a similar performance to the syntactic negation detection systems over the HCM dataset. It is also the fastest of the dependency-based negation systems explored in this study. Our results show that while a single heuristic approach to dependency-based negation detection is ignorant to certain advanced cases, it nevertheless forms a powerful and stable method, requiring minimal training and adaptation between datasets. As such, it could present a drop-in replacement or augmentation for many-rule negation approaches in clinical text-mining pipelines, particularly for cases where adaptation and rule development is not required or possible.

A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text

期刊

COMPUTERS IN BIOLOGY AND MEDICINE

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text

期刊

COMPUTERS IN BIOLOGY AND MEDICINE

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文