4.6 Article

CODER: Knowledge-infused cross-lingual medical term embedding for term normalization

期刊

JOURNAL OF BIOMEDICAL INFORMATICS
卷 126, 期 -, 页码 -

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jbi.2021.103983

关键词

Medical term normalization; Cross-lingual; Medical term representation; Knowledge graph embedding; Contrastive learning

资金

  1. Natural Science Foundation of Bei-jing Municipality [Z190024]
  2. National Natural Science Foundation of China [11801301]
  3. National Key R&D Program of China [2016YFC0901901]
  4. Tsinghua University Initiative Scientific Research Program

向作者/读者索取更多资源

This paper introduces knowledge-aware embedding, CODER, a critical tool for medical term normalization. By utilizing contrastive learning and a medical knowledge graph, CODER can extract semantic similarity and relatedness of medical concepts, which can be used for medical term normalization or feature extraction for machine learning.
Objective: This paper aims to propose knowledge-aware embedding, a critical tool for medical term normalization. Methods: We develop CODER (Cross-lingual knowledge-infused medical term embedding) via contrastive learning based on a medical knowledge graph (KG) named the Unified Medical Language System, and similarities are calculated utilizing both terms and relation triplets from the KG. Training with relations injects medical knowledge into embeddings and can potentially improve their performance as machine learning features. Results: We evaluate CODER based on zero-shot term normalization, semantic similarity, and relation classifi-cation benchmarks, and the results show that CODER outperforms various state-of-the-art biomedical word embeddings, concept embeddings, and contextual embeddings. Conclusion: CODER embeddings excellently reflect semantic similarity and relatedness of medical concepts. One can use CODER for embedding-based medical term normalization or to provide features for machine learning. Similar to other pretrained language models, CODER can also be fine-tuned for specific tasks. Codes and models are available at https://github.com/GanjinZero/CODER.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据