4.5 Article

A simple Galois Power-of-Two real time embedding scheme for performing Arabic morphology deep learning tasks

Journal

EGYPTIAN INFORMATICS JOURNAL
Volume 22, Issue 1, Pages 35-43

Publisher

CAIRO UNIV, FAC COMPUTERS & INFORMATION
DOI: 10.1016/j.eij.2020.03.002

Keywords

Word embeddings; Deep learning; Arabic morphology; Galois power of two; Real-time embeddings; Parallel neural model graph

Funding

  1. EIAS Data Science & Blockchain Lab, Prince Sultan University

Ask authors/readers for more resources

GPOW2 is a real-time embedding scheme that computes multilevel embeddings on the fly, improving the performance and accuracy of the SWAM Arabic morphological engine.
This paper describes how a simple novel Galois Power-of-Two (GPOW2) real-time embedding scheme is used to improve the performance and accuracy of downstream NLP tasks. GPOW2 computes embeddings live on the fly (real time) in the context of target NLP tasks without the need for tabulated pre-embeddings. One excellent feature of the method is the ability to capture multilevel embeddings in the same pass. It simultaneously computes character, word and sentence embeddings on the fly. GPOW2 has been derived in the context of attempts to improve the performance of the SWAM Arabic morphological engine, which is a multipurpose tool that supports segmentation, classification, POS tagging, spell checking, word embeddings, sematic search, among other tasks. SWAM is a pattern-oriented algorithm that relies on morphological patterns and POS tagging to perform NLP tasks. The paper demonstrates how GPOW2 led to improvements in the accuracy of POS tagging and pattern matching, and accordingly the performance of the whole engine. The accuracy for pattern prediction is 99.47% and is 98.80% for POS tagging. (C) 2020 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Computers and Artificial Intelligence, Cairo University.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available