4.7 Article

Named Entity Aware Transfer Learning for Biomedical Factoid Question Answering

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2021.3079339

关键词

Task analysis; Biological system modeling; Knowledge discovery; Transfer learning; Bagging; Semantics; Natural language processing; Biomedical factoid question answering; transfer learning; name entity; question representation; ensemble

资金

  1. State Key Laboratory, Software Development Environment of China [SKLSDE-2021ZX-16]
  2. National Natural Science Foundation of China [61977003]

向作者/读者索取更多资源

Biomedical factoid question answering is a crucial task in biomedical question answering applications. This study proposes a framework that fine-tunes BioBERT with a named entity dataset to improve question answering performance. BiLSTM is applied to encode the question text for sentence-level information, and bagging is used to combine question and token level information for enhanced overall performance.
Biomedical factoid question answering is an important task in biomedical question answering applications. It has attracted much attention because of its reliability. In question answering systems, better representation of words is of great importance, and proper word embedding can significantly improve the performance of the system. With the success of pretrained models in general natural language processing tasks, pretrained models have been widely used in biomedical areas, and many pretrained model-based approaches have been proven effective in biomedical question-answering tasks. In addition to proper word embedding, name entities also provide important information for biomedical question answering. Inspired by the concept of transfer learning, in this study, we developed a mechanism to fine-tune BioBERT with a named entity dataset to improve the question answering performance. Furthermore, we applied BiLSTM to encode the question text to obtain sentence-level information. To better combine the question level and token level information, we use bagging to further improve the overall performance. The proposed framework was evaluated on BioASQ 6b and 7b datasets, and the results have shown that our proposed framework can outperform all baselines.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据