4.5 Article

Enhancing Arabic-text feature extraction utilizing label-semantic augmentation in few/zero-shot learning

期刊

EXPERT SYSTEMS
卷 40, 期 8, 页码 -

出版社

WILEY
DOI: 10.1111/exsy.13329

关键词

Arabic text classification; contextual embeddings; feature extraction; few; zero-shot learning; label semantics

向作者/读者索取更多资源

An increasing number of studies are using pre-trained language models to tackle few/zero-shot text classification problems. However, most of these studies fail to consider the semantic information embedded in the natural language class labels. This work demonstrates how label information can be leveraged to enhance feature representation in input texts, particularly in scenarios with scarce data resources and short texts lacking semantic information like tweets. The study also shows the effectiveness of zero-shot implementation in predicting new classes across different domains, achieving high accuracy in Arabic sarcasm detection.
A growing amount of research use pre-trained language models to address few/zero-shot text classification problems. Most of these studies neglect the semantic information hidden implicitly beneath the natural language names of class labels and develop a meta learner from the input texts solely. In this work, we demonstrate how label information can be utilized to extract enhanced feature representation of the input text from a Transformer-based pre-trained language model such as AraBERT. In addition, how this approach can improve performance when the data resources are scarce like in the Arabic language and the input text is short with little semantic information as is the case using tweets. The work also applies zero-shot text classification to predict new classes with no training examples across different domains including sarcasm detection and sentiment analysis using the information in the last layer of a trained classifier in a transfer learning setting. Experiments show that our approach has a better performance for the few-shot sentiment classification compared to baseline models and models trained without augmenting label information. Moreover, the zero-shot implementation achieved an accuracy up to 0.874 in Arabic sarcasm detection from a model trained on a sentiment analysis task.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据