☆ 4.5 Article Proceedings Paper

Computing inter-document similarity with Context Semantic Analysis

INFORMATION SYSTEMS (2019)

Journal

INFORMATION SYSTEMS

Volume 80, Issue -, Pages 136-147

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.is.2018.02.009

Keywords

Knowledge base; Knowledge graph; Inter-document similarity; Similarity measures; Information Retrieval

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

We propose a novel knowledge-based technique for inter-document similarity computation, called Context Semantic Analysis (CSA). Several specialized approaches built on top of specific knowledge base (e.g. Wikipedia) exist in literature, but CSA differs from them because it is designed to be portable to any RDF knowledge base. In fact, our technique relies on a generic RDF knowledge base (e.g. DBpedia and Wikidata) to extract from it a Semantic Context Vector, a novel model for representing the context of a document, which is exploited by CSA to compute inter-document similarity effectively. Moreover, we show how CSA can be effectively applied in the Information Retrieval domain. Experimental results show that: (i) for the general task of inter-document similarity, CSA outperforms baselines built on top of traditional methods, and achieves a performance similar to the ones built on top of specific knowledge bases; (ii) for Information Retrieval tasks, enriching documents with context (i.e., employing the Semantic Context Vector model) improves the results quality of the state-of-the-art technique that employs such similar semantic enrichment. (C) 2018 Elsevier Ltd. All rights reserved.

Computing inter-document similarity with Context Semantic Analysis

Journal

INFORMATION SYSTEMS

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Computing inter-document similarity with Context Semantic Analysis

Journal

INFORMATION SYSTEMS

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper