☆ 4.5 Article

DuCL: Dual-stage contrastive learning framework for Chinese semantic textual matching

COMPUTERS & ELECTRICAL ENGINEERING (2023)

期刊

COMPUTERS & ELECTRICAL ENGINEERING

卷 106, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.compeleceng.2022.108574

关键词

Semantic textual matching; Contrastive learning; Sentence-level representation; Pair-level representation

类别

Computer Science, Hardware & Architecture Computer Science, Interdisciplinary Applications Engineering, Electrical & Electronic

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Chinese semantic textual matching is a challenging task in NLP, and accurately capturing both intra-text and inter-text features is crucial. Existing methods usually only utilize contrastive learning at a single perspective, leading to suboptimal performance. To address this, we propose a dual-stage contrastive learning framework (DuCL) for Chinese textual matching, which incorporates a block-enhanced interaction module to generate a semantic matching representation. Experimental results on real-world datasets demonstrate the superiority of our method over representative and state-of-the-art methods.

Chinese semantic textual matching is a fundamental yet challenging task in natural language processing (NLP). How to accurately capture the features in a single piece of text and the interactive features between pieces of text is the core problem of the task. Although pretrained language models (PLMs) and contrastive learning (CL) have been applied to address the problem to some extent, the existing works usually just utilize contrastive learning to finetune the PLMs on one single perspective, such as the sentence or pair level, which neglects to capture the semantic features from the other perspective, leading to inefficient learning and suboptimal performance. To tackle the problem, we propose a novel dual-stage contrastive learning framework (DuCL) for Chinese semantic textual matching. Specifically, DuCL consists of two stages sequentially, i.e., CL on the sentence level and CL on the pair level, each of which is responsible to finetune PLMs from the corresponding perspective. Besides, DuCL introduces a block-enhanced interaction module to integrate token-level and block-level interactive features to generate a semantic matching representation for two pieces of text. Extensive experimental results on two real-world public datasets demonstrate that our method can achieve better performance than the representative and state-of-the-art methods.

DuCL: Dual-stage contrastive learning framework for Chinese semantic textual matching

期刊

COMPUTERS & ELECTRICAL ENGINEERING

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

DuCL: Dual-stage contrastive learning framework for Chinese semantic textual matching

期刊

COMPUTERS & ELECTRICAL ENGINEERING

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文