4.2 Article

Deep-Cov19-Hate: A Textual-Based Novel Approach for Automatic Detection of Hate Speech in Online Social Networks throughout COVID-19 with Shallow and Deep Learning Models

期刊

TEHNICKI VJESNIK-TECHNICAL GAZETTE
卷 29, 期 1, 页码 149-156

出版社

UNIV OSIJEK, TECH FAC
DOI: 10.17559/TV-20210708143535

关键词

COVID-19; hate speech detection; shallow and deep learning models; social media analysis; social network problems

向作者/读者索取更多资源

The increasing use of various online social media platforms has led to the sharing of correct or incorrect information by users, particularly during COVID-19. This has resulted in a negative reaction towards East Asia (especially China) on these platforms, with social media users spreading degrading, racist, disrespectful, and offensive posts, accusing Asian people of being responsible for the outbreak of COVID-19. To address this issue, the development of a Hate Speech Detection (HSD) system was necessary to prevent the spread of such posts related to COVID-19. In this article, a textual-based study using Shallow Learning (SL) and Deep Learning (DL) methods was conducted to analyze and detect hate speech sharing in online social networks related to COVID-19.
The use of various online social media platforms rising day by day caused an increase in the correct or incorrect information shared by users, especially during COVID-19. The introduction of COVID-19 on the world agenda gave rise to an overall bad reaction against East Asia (esp. China) in online social media platforms. The social media users who spread degrading, racist, disrespectful, abusive, discriminatory, critical, abuse, harsh, offensive, etc. posts accused the Asian people of being responsible for the outbreak of COVID-19. For this reason, the development of the Hate Speech Detection (HSD) system was necessary in order to prevent the spread of these posts about COVID-19. In this article, a textual-based study on COVID-19-related hate speech (HS) sharing in online social networks was carried out with Shallow Learning (SL) and Deep Learning (DL) methods. In the first step of this study, typical Natural Language Processing (NLP) pipeline was applied for gathered two different datasets. This NLP pipeline was performed using bag of words, term frequency, document matrix, etc. techniques for features extraction representing datasets. Then, ten different SL and DL models were fine-tuned for HS datasets related to COVID-19. Accuracy, precision, sensitivity, and F-score performance measurement criteria were calculated to compare the performance of the SL and DL algorithms for the problem of HSD. The RNN, one of the models proposed for the first and second dataset in HSD, prevailed with the highest accuracy values of 78.7% and 90.3%, respectively. Due to the promising results of all approaches operated in the HSD, they are forecasted to be chosen in the solution of many other social media and network problems related to COVID-19.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据