4.7 Article

Arabic sentiment analysis using GCL-based architectures and a customized regularization function

出版社

ELSEVIER - DIVISION REED ELSEVIER INDIA PVT LTD
DOI: 10.1016/j.jestch.2023.101433

关键词

Arabic sentiment analysis (ASA); Natural language processing (NLP); Custom regularization function (CRF); Gated convolution long (GCL)

向作者/读者索取更多资源

Sentiment analysis aims to extract emotions from textual data, and various challenges have emerged due to the proliferation of social media platforms and the flow of data in the Arabic language. This paper introduces Gated Convolution Long (GCL), an architecture designed for Arabic Sentiment Analysis, which overcomes difficulties with lengthy sequence training samples and improves performance for binary and multiple classifications. The proposed method achieves better results than baselines in various Arabic datasets, and includes a Custom Regularization Function (CRF) that enhances performance and optimizes validation loss. Furthermore, the paper explores the relationship between Modern Standard Arabic and five Arabic dialects through a cross-dialect training study.
Sentiment analysis aims to extract emotions from textual data; with the proliferation of various social media platforms and the flow of data, particularly in the Arabic language, significant challenges have arisen, necessitating the development of various frameworks to handle issues. In this paper, we firstly design an architecture called Gated Convolution Long (GCL) to perform Arabic Sentiment Analysis. GCL can overcome difficulties with lengthy sequence training samples, extracting the optimal features that help improve Arabic sentiment analysis performance for binary and multiple classifications. The proposed method trains and tests in various Arabic datasets; The results are better than the baselines in all cases. GCL includes a Custom Regularization Function (CRF), which improves the performance and optimizes the validation loss. We carry out an ablation study and investigate the effect of removing CRF. CRF is shown to make a difference of up to 5.10% (2C) and 4.12% (3C). Furthermore, we study the relationship between Modern Standard Arabic and five Arabic dialects via a cross-dialect training study. Finally, we apply GCL through standard regularization (GCL+L1, GCL+L2, and GCL+LElasticNet) and our Lnew on two big Arabic sentiment datasets; GCL+Lnew gave the highest results (92.53%) with less performance time.& COPY; 2023 Karabuk University. Publishing services by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据