期刊
INFORMATION PROCESSING & MANAGEMENT
卷 58, 期 6, 页码 -出版社
ELSEVIER SCI LTD
DOI: 10.1016/j.ipm.2021.102738
关键词
Text matching; Deep learning; Deep interaction; Attention mechanism; Neural network
资金
- National Natural Science Foundation of China [71974202, 71921002, 71790612, 72174153]
- Ministry of Education of China [19YJC870029, 17JZD034]
- Fundamental Research Funds for the Central Universities [2722021AJ011]
A novel Deep Interactive Text Matching (DITM) model is proposed in this study, which effectively captures the interactive information between text pairs and has high generalization ability among different tasks.
In recent years, text matching has gained increasing research focus and shown great improvements. However, due to the long-distance dependency and polysemy, existing text matching models cannot effectively capture the contextual and implicit semantic information of texts. Additionally, existing models are lack of generalization ability when applied to different scenarios. In this study, we propose a novel Deep Interactive Text Matching (DITM) model by integrating the encoder layer, the co-attention layer, and the fusion layer as an interaction module, based on a matching-aggregation framework. In particular, the interaction process is iterated multiple times to obtain the in-depth interaction information, and the relationship between the text pair is extracted through the multi-perspective pooling. We conduct extensive experiments on four text matching tasks, i.e., opinion retrieval, answer selection, paraphrase identification and natural language inference. Compared with the state-of-the-art text matching methods, the proposed model achieved the best results on most of the tasks, which proves that our model could effectively capture the interactive information between text pairs, and has a high generalization ability among different tasks. Further multi-lingual investigations show the similarities of the performance between English and Chinese, which suggest that our model could be ported to other languages. The research contributes a simple and efficient implementation of text matching in a situation where there is limited computing capacity, and sheds light on leveraging text matching models to facilitate a range of downstream tasks.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据