4.7 Article

Arabic sentiment analysis using GCL-based architectures and a customized regularization function

Publisher

ELSEVIER - DIVISION REED ELSEVIER INDIA PVT LTD
DOI: 10.1016/j.jestch.2023.101433

Keywords

Arabic sentiment analysis (ASA); Natural language processing (NLP); Custom regularization function (CRF); Gated convolution long (GCL)

Ask authors/readers for more resources

Sentiment analysis aims to extract emotions from textual data, and various challenges have emerged due to the proliferation of social media platforms and the flow of data in the Arabic language. This paper introduces Gated Convolution Long (GCL), an architecture designed for Arabic Sentiment Analysis, which overcomes difficulties with lengthy sequence training samples and improves performance for binary and multiple classifications. The proposed method achieves better results than baselines in various Arabic datasets, and includes a Custom Regularization Function (CRF) that enhances performance and optimizes validation loss. Furthermore, the paper explores the relationship between Modern Standard Arabic and five Arabic dialects through a cross-dialect training study.
Sentiment analysis aims to extract emotions from textual data; with the proliferation of various social media platforms and the flow of data, particularly in the Arabic language, significant challenges have arisen, necessitating the development of various frameworks to handle issues. In this paper, we firstly design an architecture called Gated Convolution Long (GCL) to perform Arabic Sentiment Analysis. GCL can overcome difficulties with lengthy sequence training samples, extracting the optimal features that help improve Arabic sentiment analysis performance for binary and multiple classifications. The proposed method trains and tests in various Arabic datasets; The results are better than the baselines in all cases. GCL includes a Custom Regularization Function (CRF), which improves the performance and optimizes the validation loss. We carry out an ablation study and investigate the effect of removing CRF. CRF is shown to make a difference of up to 5.10% (2C) and 4.12% (3C). Furthermore, we study the relationship between Modern Standard Arabic and five Arabic dialects via a cross-dialect training study. Finally, we apply GCL through standard regularization (GCL+L1, GCL+L2, and GCL+LElasticNet) and our Lnew on two big Arabic sentiment datasets; GCL+Lnew gave the highest results (92.53%) with less performance time.& COPY; 2023 Karabuk University. Publishing services by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available