☆ 4.7 Article

Automation of penicillin adverse drug reaction categorisation and risk stratification with machine learning natural language processing

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS (2021)

Journal

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS

Volume 156, Issue -, Pages -

Publisher

ELSEVIER IRELAND LTD

DOI: 10.1016/j.ijmedinf.2021.104611

Keywords

Penicillin; Adverse drug reaction; Electronic health records; Machine learning; Natural language processing

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study utilized machine learning natural language processing to classify and assess the risk of penicillin ADRs with high accuracy. The artificial neural network model performed best in categorizing allergy or intolerance and evaluating true allergy risk, outperforming simpler algorithms and achieving high sensitivity and specificity. The findings suggest that automated evaluation using machine learning models may improve the specificity of prescribing alerts and facilitate delabelling in real-time risk stratification.

Background: The penicillin adverse drug reaction (ADR) label is common in electronic health records (EHRs). However, there is significant misclassification between allergy and intolerance within the EHR and most patients can be delabelled after an immunologic assessment. Machine learning natural language processing may be able to assist with the categorisation and risk stratification of penicillin ADRs. Objective: The aim of this study was to use text entered into an EHR to derive and evaluate machine learning models to classify penicillin ADRs and assess the risk of true allergy. Methods: Machine learning natural language processing was applied to free-text penicillin ADR data extracted from a public health system EHR. The model was developed by training on labelled dataset. ADR entries were split into training and testing datasets and used to develop and test a variety of machine learning models. These were compared to categorisation with a simple algorithm using keyword search. Results: The best performing model for the classification of penicillin ADRs as being consistent with allergy or intolerance was the artificial neural network (AUC 0.994, sensitivity 0.99, specificity 0.96). The artificial neural network also achieved the highest AUC in the classification of high- or low-risk of true allergy (AUC 0.988, sensitivity 0.99, specificity 0.99). All ADR labels were able to be classified using these machine learning models, whereas a small proportion were unclassifiable using the simple algorithm as they contained no keywords. Conclusion: Machine learning natural language processing performed similarly to expert criteria in classifying and risk stratifying penicillin ADRs labels. These models outperformed simpler algorithms in their ability to interpret free-text data contained in the EHR. The automated evaluation of penicillin ADR labels may allow realtime risk stratification to facilitate delabelling and improve the specificity of prescribing alerts.

Automation of penicillin adverse drug reaction categorisation and risk stratification with machine learning natural language processing

Journal

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS

Publisher

ELSEVIER IRELAND LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Automation of penicillin adverse drug reaction categorisation and risk stratification with machine learning natural language processing

Journal

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS

Publisher

ELSEVIER IRELAND LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper