4.7 Article

A Pseudo-relevance feedback framework combining relevance matching and semantic matching for information retrieval

Journal

INFORMATION PROCESSING & MANAGEMENT
Volume 57, Issue 6, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.ipm.2020.102342

Keywords

Information retrieval; Pseudo-relevance feedback; Text similarity; Semantic matching

Funding

  1. National Natural Science Foundation of China [61572223]
  2. National Key Research and Development Program of China [2017YFC0909502]
  3. Wuhan Science and Technology Program [2019010701011392]
  4. innovation team of the basic intelligent education service innovation model and technology research, in Hubei Normal University

Ask authors/readers for more resources

Pseudo-relevance feedback (PRF) is a well-known method for addressing the mismatch between query intention and query representation. Most current PRF methods consider relevance matching only from the perspective of terms used to sort feedback documents, thus possibly leading to a semantic gap between query representation and document representation. In this work, a PRF framework that combines relevance matching and semantic matching is proposed to improve the quality of feedback documents. Specifically, in the first round of retrieval, we propose a reranking mechanism in which the information of the exact terms and the semantic similarity between the query and document representations are calculated by bidirectional encoder representations from transformers (BERT); this mechanism reduces the text semantic gap by using the semantic information and improves the quality of feedback documents. Then, our proposed PRF framework is constructed to process the results of the first round of retrieval by using probability-based PRF methods and language-model-based PRF methods. Finally, we conduct extensive experiments on four Text Retrieval Conference (TREC) datasets. The results show that the proposed models outperform the robust baseline models in terms of the mean average precision (MAP) and precision P at position 10 (P@10), and the results also highlight that using the combined relevance matching and semantic matching method is more effective than using relevance matching or semantic matching alone in terms of improving the quality of feedback documents.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available