4.6 Article

Designing an efficient unigram keyword detector for documents using Relative Entropy

期刊

MULTIMEDIA TOOLS AND APPLICATIONS
卷 81, 期 26, 页码 37747-37761

出版社

SPRINGER
DOI: 10.1007/s11042-022-12657-x

关键词

Keyword extraction; Feature extraction

向作者/读者索取更多资源

In this work, a statistical approach to identify unigram keywords for a document is proposed. The approach does not require pre-training of the model and evaluates terms using relative entropy, displacement, and variance, comparing their effectiveness with term frequency.
In this work we propose a statistical approach to identify unigram keywords for a document. We identify unigram keywords as features which effectively captures the importance of a word in a document and evaluates its potential to be a keyword. We make use of relative entropy, displacement and variance of terms in a document have been evaluated in the context of keyword identification. The proposed approach works on single documents without the requirement of any pre-training of the model. We also evaluate the effectiveness of our features against the gold standard of term frequency and compare the usefulness of the proposed feature set with term frequency. The results of our proposed method are presented and compared with existing algorithms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据