☆ 4.2 Article

SimiT: A Text Similarity Method Using Lexicon and Dependency Representations

NEW GENERATION COMPUTING (2020)

期刊

NEW GENERATION COMPUTING

卷 38, 期 3, 页码 509-530

出版社

SPRINGER

DOI: 10.1007/s00354-020-00099-8

关键词

Text similarity; Dependency parser embeddings; Lexicon embeddings

类别

Computer Science, Hardware & Architecture Computer Science, Theory & Methods

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Semantic textual similarity methods are becoming increasingly crucial in text mining research areas such as text retrieval and summarization. Existing methods of text similarity have often been computed by their shallow or syntactic representation rather than considering their semantic content and meanings. This paper focuses mainly on computing the similarity between sentences without a supervised learning approach, only considering their word-level coherence which is calculated by a hybrid method of dependency parser and lexicon embeddings. Hence, we concentrate on structural similarity between text pairs by regarding their dependency parser embeddings. Our hybrid method also pays attention to the semantic information of words implied in the sentences. In the evaluation, we compare our method with the state-of-the-art semantic similarity measures in a well-known dataset. Our method outperforms most of the studies in the literature and the overall performance achieves better results when combining the similarity scores of both embedding models.

SimiT: A Text Similarity Method Using Lexicon and Dependency Representations

期刊

NEW GENERATION COMPUTING

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

SimiT: A Text Similarity Method Using Lexicon and Dependency Representations

期刊

NEW GENERATION COMPUTING

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文