☆ 4.2 Article

SimiT: A Text Similarity Method Using Lexicon and Dependency Representations

NEW GENERATION COMPUTING (2020)

Journal

NEW GENERATION COMPUTING

Volume 38, Issue 3, Pages 509-530

Publisher

SPRINGER

DOI: 10.1007/s00354-020-00099-8

Keywords

Text similarity; Dependency parser embeddings; Lexicon embeddings

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Semantic textual similarity methods are becoming increasingly crucial in text mining research areas such as text retrieval and summarization. Existing methods of text similarity have often been computed by their shallow or syntactic representation rather than considering their semantic content and meanings. This paper focuses mainly on computing the similarity between sentences without a supervised learning approach, only considering their word-level coherence which is calculated by a hybrid method of dependency parser and lexicon embeddings. Hence, we concentrate on structural similarity between text pairs by regarding their dependency parser embeddings. Our hybrid method also pays attention to the semantic information of words implied in the sentences. In the evaluation, we compare our method with the state-of-the-art semantic similarity measures in a well-known dataset. Our method outperforms most of the studies in the literature and the overall performance achieves better results when combining the similarity scores of both embedding models.

SimiT: A Text Similarity Method Using Lexicon and Dependency Representations

Journal

NEW GENERATION COMPUTING

Publisher

SPRINGER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

SimiT: A Text Similarity Method Using Lexicon and Dependency Representations

Journal

NEW GENERATION COMPUTING

Publisher

SPRINGER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper