3.8 Proceedings Paper

Stemming and Lemmatization for Information Retrieval Systems in Amazigh Language

期刊

BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018
卷 872, 期 -, 页码 222-233

出版社

SPRINGER-VERLAG BERLIN
DOI: 10.1007/978-3-319-96292-4_18

关键词

Search engine; HMM; Lemmatization; Stemming; Machine learning

向作者/读者索取更多资源

Stemming and lemmatization are two language modeling techniques used to improve the document retrieval precision performances. Stemming is a procedure to reduce all words with the same stem to a common form whereas lemmatization removes inflectional endings and returns the base form of a word. The idea of this paper is to explain how a stemming or lemmatization in Amazigh language can improve the search outcomes by providing results that fit better with the query the user introduced. In Document retrieval systems, lemmatization produced better precision compared to stemming. Overall the findings suggest that language modeling techniques improves document retrieval, with lemmatization technique producing the best result.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据