☆ 4.6 Article

Designing an efficient unigram keyword detector for documents using Relative Entropy

MULTIMEDIA TOOLS AND APPLICATIONS (2022)

期刊

MULTIMEDIA TOOLS AND APPLICATIONS

卷 81, 期 26, 页码 37747-37761

出版社

SPRINGER

DOI: 10.1007/s11042-022-12657-x

关键词

Keyword extraction; Feature extraction

类别

Computer Science, Information Systems Computer Science, Software Engineering Computer Science, Theory & Methods Engineering, Electrical & Electronic

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this work, a statistical approach to identify unigram keywords for a document is proposed. The approach does not require pre-training of the model and evaluates terms using relative entropy, displacement, and variance, comparing their effectiveness with term frequency.

In this work we propose a statistical approach to identify unigram keywords for a document. We identify unigram keywords as features which effectively captures the importance of a word in a document and evaluates its potential to be a keyword. We make use of relative entropy, displacement and variance of terms in a document have been evaluated in the context of keyword identification. The proposed approach works on single documents without the requirement of any pre-training of the model. We also evaluate the effectiveness of our features against the gold standard of term frequency and compare the usefulness of the proposed feature set with term frequency. The results of our proposed method are presented and compared with existing algorithms.

Designing an efficient unigram keyword detector for documents using Relative Entropy

期刊

MULTIMEDIA TOOLS AND APPLICATIONS

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Designing an efficient unigram keyword detector for documents using Relative Entropy

期刊

MULTIMEDIA TOOLS AND APPLICATIONS

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文