4.6 Article

Designing an efficient unigram keyword detector for documents using Relative Entropy

Journal

MULTIMEDIA TOOLS AND APPLICATIONS
Volume 81, Issue 26, Pages 37747-37761

Publisher

SPRINGER
DOI: 10.1007/s11042-022-12657-x

Keywords

Keyword extraction; Feature extraction

Ask authors/readers for more resources

In this work, a statistical approach to identify unigram keywords for a document is proposed. The approach does not require pre-training of the model and evaluates terms using relative entropy, displacement, and variance, comparing their effectiveness with term frequency.
In this work we propose a statistical approach to identify unigram keywords for a document. We identify unigram keywords as features which effectively captures the importance of a word in a document and evaluates its potential to be a keyword. We make use of relative entropy, displacement and variance of terms in a document have been evaluated in the context of keyword identification. The proposed approach works on single documents without the requirement of any pre-training of the model. We also evaluate the effectiveness of our features against the gold standard of term frequency and compare the usefulness of the proposed feature set with term frequency. The results of our proposed method are presented and compared with existing algorithms.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available