4.6 Article

A hybrid medical text classification framework: Integrating attentive rule construction and neural network

期刊

NEUROCOMPUTING
卷 443, 期 -, 页码 345-355

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2021.02.069

关键词

Hybrid system; Deep learning; Attention mechanism; Text classification

资金

  1. National Natural Science Foundation of China [71471092]
  2. Natural Science Foundation of Zhejiang Province [LR17G010001]
  3. Ningbo Municipal Bureau of Science and Technology [2017D10034, 2019B10026, 2019F1028]

向作者/读者索取更多资源

The main goal of this study is to enhance the quality and transparency of medical text classification solutions by proposing a three-stage hybrid method combining bi-directional Long Short-Term Memory and regular expression classifier. Experimental results demonstrate the superiority of the proposed approach in selecting domain specific and topic-related features, achieving an accuracy of 0.89 and an F1-score of 0.92. Furthermore, the versatility of regular expressions as a user-level tool for interpretable solutions and human modification is also highlighted.
The main objective of this work is to improve the quality and transparency of the medical text classification solutions. Conventional text classification methods provide users with only a restricted mechanism (based on frequency) for selecting features. In this paper, a three-stage hybrid method combining the gated attention-based bi-directional Long Short-Term Memory (ABLSTM) and the regular expression based classifier is proposed for medical text classification tasks. The bi-directional Long Short-Term Memory (LSTM) architecture with an attention layer allows the network to weigh words according to their perceived importance and focus on crucial parts of a sentence. Feature words (or keywords) extracted by ABLSTM model are utilized to guide the regular expression rule construction. Our proposed approach leverages the advantages of both the interpretability of rule-based algorithms and the computational power of deep learning approaches for a production-ready scenario. Experimental results on real world medical online query data clearly validate the superiority of our system in selecting domain specific and topic-related features. Results show that the proposed approach achieves an accuracy of 0.89 and an F-1-score of 0.92 respectively. Furthermore, our experimentation also illustrates the versatility of regular expressions as a user-level tool for focusing on desired patterns and providing interpretable solutions for human modification. (C) 2021 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据