4.8 Article

A joint model for entity boundary detection and entity span recognition

出版社

ELSEVIER
DOI: 10.1016/j.jksuci.2022.08.016

关键词

Named entity recognition; Boundary detection; Negative samples; Neural networks

资金

  1. National Natural Science Foundation of China [62166007, 62066008, 62066007]
  2. Natural Science Foundation of Guizhou Province [[2022] ZK027]

向作者/读者索取更多资源

This paper presents a model for detecting entity boundaries and predicting entity candidates jointly. The model makes predictions based on gap representations between words, avoiding ambiguity when a token has multiple labels. Additionally, a neighborhood span proposal strategy is proposed to address the issue of data imbalance. The model achieves performance close to the state-of-the-art in ACE2005 and GENIA corpora.
Named entity recognition is a task to extract named entities with predefined entity types. Span classifi-cation is a popular method to support this task. It has the advantage to solve nested structures and make full use of token features in a span. The problem is that exhaustively enumerating and verifying all entity spans suffer from high computational complexity and data imbalance. Furthermore, spans with a high overlapping ratio share the same contextual features in a sentence, which is easy to lead to false positive errors caused by inaccurate entity boundaries. In this paper, we present a model to detect the entity boundaries and predict entity candidates jointly. Instead of labeling tokens, our model makes the predic-tion based on gap representations between words, which avoids the ambiguity when a token has several labels. We also propose a neighborhood span proposal strategy to generate reasonable negative samples for training, which effectively reduces the data imbalance problem. Our model is evaluated on the ACE2005 and GENIA corpora. It achieves performance close to the state-of-the-art in F1 scores of 88.55% and 79.81%, respectively.(c) 2022 The Author(s). Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据