4.7 Article

Adverse Drug Reaction extraction: Tolerance to entity recognition errors and sub-domain variants

Journal

Publisher

ELSEVIER IRELAND LTD
DOI: 10.1016/j.cmpb.2020.105891

Keywords

Natural language processing; Adverse drug reaction extraction; Electronic health records; Deep learning

Funding

  1. NVIDIA Corporation
  2. Spanish Ministry of Science and Innovation [PAT-MED PID2019-106942RB-C31]
  3. Basque Government [IXA IT-1343-19, ElkartekKK-2019/00045, PRE 2018 2 0265]

Ask authors/readers for more resources

This work focuses on extracting ADRs from EHRs in Spanish using a deep neural network approach. The system shows the ability to handle cross-hospital predictions and has a certain tolerance to external variations. However, errors in entity recognition can lead to a decrease in performance in the ADR detection stage.
Background and Objective: This work tackles the Adverse Drug Reaction (ADR) extraction in Electronic Health Records (EHRs) written in Spanish. This task is within the framework of natural language processing. It consists of extracting relations between drug-disease pairs, with the drug as the causing agent of the reaction. To this end, a pipeline is employed: first, relevant clinical entities are recognized (e.g. drugs, active ingredients, findings, symptoms); next, drug-disease candidate pairs are judged as either ADR or non-ADR. To develop this task, it is necessary to tackle some challenges. First, EHRs show high lexical variability. Second, EHRs are scarce due to their sensitive information. Third, the ADR detection stage has to cope with errors derived from the entity recognition. Methods: To develop the ADR detection we decided to employ a deep neural network approach. In order to asses the tolerance to external variations, the system was exposed to different levels of noise. First, with three corpora that contain documents from different hospitals, size and class imbalance ratio. Furthermore, it was exposed to cross-corpus relation extraction. Second, we assessed the sensitivity of the ADR detection stage to noise introduced by the automatic Medical Entity Recognition (MER). Results: The system can cope with cross-hospital predictions provided that it was trained with a large corpus. In the most challenging situation an f-measure of 75.2 was achieved. With respect to the tolerance to errors derived from the entity recognition step, with a medical entity recognizer that missed 20% of the entities, the f-measure in the ADR detection stage decreased to 68.6. Conclusions: The ADR extraction is tackled as a cause-effect relation extraction task between drugs and diseases. It is advisable to employ as many EHRs as possible in order to make more robust the ADR extraction. Despite the entities missed in the MER step, the drop in the performance is not high with the proposed system. (C) 2020 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available