Journal
MULTIMEDIA TOOLS AND APPLICATIONS
Volume 81, Issue 26, Pages 37747-37761Publisher
SPRINGER
DOI: 10.1007/s11042-022-12657-x
Keywords
Keyword extraction; Feature extraction
Ask authors/readers for more resources
In this work, a statistical approach to identify unigram keywords for a document is proposed. The approach does not require pre-training of the model and evaluates terms using relative entropy, displacement, and variance, comparing their effectiveness with term frequency.
In this work we propose a statistical approach to identify unigram keywords for a document. We identify unigram keywords as features which effectively captures the importance of a word in a document and evaluates its potential to be a keyword. We make use of relative entropy, displacement and variance of terms in a document have been evaluated in the context of keyword identification. The proposed approach works on single documents without the requirement of any pre-training of the model. We also evaluate the effectiveness of our features against the gold standard of term frequency and compare the usefulness of the proposed feature set with term frequency. The results of our proposed method are presented and compared with existing algorithms.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available