☆ 4.6 Article

Hierarchical label-wise attention transformer model for explainable ICD coding

JOURNAL OF BIOMEDICAL INFORMATICS (2022)

Journal

JOURNAL OF BIOMEDICAL INFORMATICS

Volume 133, Issue -, Pages -

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE

DOI: 10.1016/j.jbi.2022.104161

Keywords

Hierarchical label-wise attention; Transformers; Explainability; ICD coding; MIMIC-III

Funding

Australian government
Commonwealth Industrial and Scientific Research Organisation (CSIRO)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

In this study, a hierarchical label-wise attention Transformer model (HiLAT) is proposed for the explainable prediction of ICD codes from clinical documents. The model utilizes a pretrained Transformer model and a hierarchical label-wise attention mechanism to predict the assignment of specific ICD codes to clinical documents. Experimental results show that HiLAT + ClinicalplusXLNet outperforms previous state-of-the-art models for predicting ICD-9 codes, and attention weight visualizations serve as a potential tool for validating ICD code predictions.

International Classification of Diseases (ICD) coding plays an important role in systematically classifying morbidity and mortality data. In this study, we propose a hierarchical label-wise attention Transformer model (HiLAT) for the explainable prediction of ICD codes from clinical documents. HiLAT firstly fine-tunes a pretrained Transformer model to represent the tokens of clinical documents. We subsequently employ a two-level hierarchical label-wise attention mechanism that creates label-specific document representations. These representations are in turn used by a feed-forward neural network to predict whether a specific ICD code is assigned to the input clinical document of interest. We evaluate HiLAT using hospital discharge summaries and their corresponding ICD-9 codes from the MIMIC-III database. To investigate the performance of different types of Transformer models, we develop ClinicalplusXLNet, which conducts continual pretraining from XLNet-Base using all the MIMIC-III clinical notes. The experiment results show that the F1 scores of the HiLAT + ClinicalplusXLNet outperform the previous state-of-the-art models for the top-50 most frequent ICD-9 codes from MIMIC-III. Visualisations of attention weights present a potential explainability tool for checking the face validity of ICD code predictions.

Hierarchical label-wise attention transformer model for explainable ICD coding

Journal

JOURNAL OF BIOMEDICAL INFORMATICS

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Hierarchical label-wise attention transformer model for explainable ICD coding

Journal

JOURNAL OF BIOMEDICAL INFORMATICS

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper