4.5 Article

A BERT-Based Hybrid Short Text Classification Model Incorporating CNN and Attention-Based BiGRU

Journal

Publisher

IGI GLOBAL
DOI: 10.4018/JOEUC.294580

Keywords

Deep Learning; Fusion Framework; Natural Language Processing; Short Text Classification

Funding

  1. National Social Science Foundation of China [19BTQ032]

Ask authors/readers for more resources

The paper proposes a feature fusion framework based on BERT, where CNN captures static features, BiGRU captures contextual features, and an attention mechanism assigns weight to salient words. Experimental results show that the model outperforms other state-of-the-art baseline methods.
Short text classification is a research focus for natural language processing (NLP), which is widely used in news classification, sentiment analysis, mail filtering, and other fields. In recent years, deep learning techniques are applied to text classification and have made some progress. Different from ordinary text classification, short text has the problem of less vocabulary and feature sparsity, which raise higher request for text semantic feature representation. To address this issue, this paper proposes a feature fusion framework based on the bidirectional encoder representations from transformers (BERT). In this hybrid method, BERT is used to train word vector representation. Convolutional neural network (CNN) captures static features. As a supplement, a bi-gated recurrent neural network (BiGRU) is adopted to capture contextual features. Furthermore, an attention mechanism is introduced to assign the weight of salient words. The experimental results confirmed that the proposed model significantly outperforms the other state-of-the-art baseline methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available