3.8 Proceedings Paper

Stemming and Lemmatization for Information Retrieval Systems in Amazigh Language

Journal

BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018
Volume 872, Issue -, Pages 222-233

Publisher

SPRINGER-VERLAG BERLIN
DOI: 10.1007/978-3-319-96292-4_18

Keywords

Search engine; HMM; Lemmatization; Stemming; Machine learning

Ask authors/readers for more resources

Stemming and lemmatization are two language modeling techniques used to improve the document retrieval precision performances. Stemming is a procedure to reduce all words with the same stem to a common form whereas lemmatization removes inflectional endings and returns the base form of a word. The idea of this paper is to explain how a stemming or lemmatization in Amazigh language can improve the search outcomes by providing results that fit better with the query the user introduced. In Document retrieval systems, lemmatization produced better precision compared to stemming. Overall the findings suggest that language modeling techniques improves document retrieval, with lemmatization technique producing the best result.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available