4.1 Article

Ensemble Deep Learning for Multilabel Binary Classification of User-Generated Content

Journal

ALGORITHMS
Volume 13, Issue 4, Pages -

Publisher

MDPI
DOI: 10.3390/a13040083

Keywords

ensemble learning; sentiment analysis; multilabel classification; deep neural networks; pure emotion; Semeval 2018 Task 1; toxic comment classification

Funding

  1. Engineering and Physical Sciences Research Council [EP/M02315X/1]
  2. EPSRC [EP/M02315X/1] Funding Source: UKRI

Ask authors/readers for more resources

Sentiment analysis usually refers to the analysis of human-generated content via a polarity filter. Affective computing deals with the exact emotions conveyed through information. Emotional information most frequently cannot be accurately described by a single emotion class. Multilabel classifiers can categorize human-generated content in multiple emotional classes. Ensemble learning can improve the statistical, computational and representation aspects of such classifiers. We present a baseline stacked ensemble and propose a weighted ensemble. Our proposed weighted ensemble can use multiple classifiers to improve classification results without hyperparameter tuning or data overfitting. We evaluate our ensemble models with two datasets. The first dataset is from Semeval2018-Task 1 and contains almost 7000 Tweets, labeled with 11 sentiment classes. The second dataset is the Toxic Comment Dataset with more than 150,000 comments, labeled with six different levels of abuse or harassment. Our results suggest that ensemble learning improves classification results by 1.5% to 5.4%.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.1
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available