4.5 Article

Assisting Multimodal Named Entity Recognition by cross-modal auxiliary tasks

Journal

PATTERN RECOGNITION LETTERS
Volume 175, Issue -, Pages 52-58

Publisher

ELSEVIER
DOI: 10.1016/j.patrec.2023.10.004

Keywords

Multimodal named entity recognition; Multi-task learning; Cross-modal learning

Ask authors/readers for more resources

This paper introduces a method for improving the performance of Multimodal Named Entity Recognition (MNER) through cross-modal auxiliary tasks. The method utilizes cross-modal matching and cross-modal mutual information maximization to address the issue of mismatched image-text pairs, and separates the features of the main task and auxiliary tasks through a cross-modal gate-control mechanism.
Although the existing Multimodal Named Entity Recognition (MNER) methods have achieved promising performance, they suffer from the following drawbacks in social media scenarios. Firstly, most existing methods are based on a strong assumption that the textual content and the associated images are matched, which is not always valid in real scenarios; Secondly, current methods fail to filter out modality-specific random noise, which impedes models from exploiting modality-shared features. In this paper, a novel multi-task multimodal learning architecture is put forward, which aims to improve Multimodal Named Entity Recognition (MNER) performance by cross-modal auxiliary tasks (CMAT). Specifically, we first separate the shared and task-specific features for the main task and auxiliary tasks respectively, which is accomplished by cross-modal gate-control mechanism. Subsequently, without extra pre-processing or annotations, we utilize the cross-modal matching to address the issue of mismatched image-text pairs, and the cross-modal mutual information maximization to optimize the most relevant cross-modal features. Moreover, experimental results on the two widely used datasets confirm the superiority of our proposed approach.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available