4.5 Article

Bengali Stop Word and Phrase Detection Mechanism

Journal

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING
Volume 45, Issue 4, Pages 3355-3368

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s13369-020-04388-8

Keywords

Stop phrase; Stop word; Natural language processing; Finite automaton; Text processing

Funding

  1. Deanship of Scientific Research at King Saud University [RG-1441-394]

Ask authors/readers for more resources

Though plenty of research works have been done on stop word/phrase detection, there is no work done on Bengali stop words and stop phrases. This research innovates the definition and classification of Bengali stop words and phrases and implements two approaches to identify them. First one is a corpus-based approach, while the second one is based on the finite-state automaton. Performance of both approaches is measured and compared. Result analysis shows that corpus-based method outperforms the finite-state automaton-based method. The corpus-based and finite-state automaton-based method shows 90% and 80% of accuracy, respectively, for stop word detection and 80% and 70% accuracy, respectively, for stop phrase detection.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available