4.6 Review

A review on abusive content automatic detection: approaches, challenges and opportunities

Related references

Note: Only part of the references are listed.
Article Computer Science, Artificial Intelligence

To Augment or Not to Augment? A Comparative Study on Text Augmentation Techniques for Low-Resource NLP

Gozde Gul Sahin

Summary: This article explores the impact of text augmentation methods on language sequence tagging tasks and conducts a systematic performance analysis on multiple languages and tasks. The study finds that augmentation methods have the most significant improvement on dependency parsing and perform better on morphologically rich languages. Character-level methods consistently perform well, while improvements from synonym replacement and syntactic augmentation are inconsistent.

COMPUTATIONAL LINGUISTICS (2022)

Article Computer Science, Information Systems

Detection of hate speech in Arabic tweets using deep learning

Areej Al-Hassan et al.

Summary: Nowadays, social networks have become a common platform for communication, but there is also an issue of hate speech propagation. Research using deep learning models to analyze Arabic tweets found that these models perform better in detecting hateful content.

MULTIMEDIA SYSTEMS (2022)

Proceedings Paper Computer Science, Artificial Intelligence

HaT5: Hate Language Identification using Text-to-Text Transfer Transformer

Sana Sabah Sabry et al.

Summary: This paper investigates the performance of a state-of-the-art architecture called T5 and compares it with three previous state-of-the-art architectures across five different tasks from two diverse datasets. By using a new autoregressive conversational AI model checkpoint for training data augmentation, the authors achieve near-state-of-the-art results on some tasks and explain the predictions made by one of the models using the Integrated Gradient algorithm. The main contributions of this work are the implementation method of T5, the data augmentation technique, and the identification of shortcomings in the dataset.

2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) (2022)

Article Computer Science, Information Systems

CyberBERT: BERT for cyberbullying identification BERT for cyberbullying identification

Sayanta Paul et al.

Summary: This paper presents a novel application of BERT for cyberbullying identification and demonstrates its effectiveness through experiments, achieving significant improvements over previous deep neural network models.

MULTIMEDIA SYSTEMS (2022)

Review Computer Science, Interdisciplinary Applications

Resources and benchmark corpora for hate speech detection: a systematic review

Fabio Poletto et al.

Summary: Hate speech detection in social media has recently gained significant traction in the Natural Language Processing community, with annotated corpora and benchmarks being key resources. Lexica also play an important role in the development of hate speech detection systems. The analysis highlights a heterogeneous, growing landscape with several issues and venues for improvement.

LANGUAGE RESOURCES AND EVALUATION (2021)

Article Computer Science, Artificial Intelligence

A Survey of Offensive Language Detection for the Arabic Language

Fatemah Husain et al.

Summary: This survey reviews previous studies on Arabic offensive language detection, highlighting the importance of continued efforts and the development of new preprocessing and feature extraction techniques.

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING (2021)

Article Computer Science, Information Systems

How well do hate speech, toxicity, abusive and offensive language classification models generalize across datasets?

Paula Fortuna et al.

Summary: Research indicates that the generalization of models varies from model to model, with certain categories serving better as cross-dataset training categories. Additionally, for successful generalization, a model needs to perform well in an intra-dataset scenario, with other parameters such as training and target categories also playing a crucial role.

INFORMATION PROCESSING & MANAGEMENT (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Abusive Content Detection in Online User-Generated Data: A survey

Simrat Kaur et al.

Summary: The rise of social media platforms has led to a significant increase in user-generated content, empowering users to create, share and exchange information, but also increasing the possibility of negative content spreading by cyber-bullies and haters. The research community has proposed many automated approaches utilizing NLP, machine learning and deep learning techniques to identify offensive content, while facing numerous challenges.

AI IN COMPUTATIONAL LINGUISTICS (2021)

Article Computer Science, Information Systems

Detecting White Supremacist Hate Speech Using Domain Specific Word Embedding With Deep Learning and BERT

Hind S. Alatawi et al.

Summary: White supremacist hate speech has become a harmful content on social media, with detrimental impacts on society, highlighting the need for automatic detection of such speech. This research explores the feasibility of detecting white supremacist hate speech on Twitter using deep learning and natural language processing techniques, utilizing BiLSTM and BERT models for this purpose. The BiLSTM model achieved a 0.75 F1-score, while BERT reached a higher 0.80 F1-score, both tested on a balanced dataset from Twitter and a Stormfront dataset.

IEEE ACCESS (2021)

Article Multidisciplinary Sciences

A Deep Learning Framework for Automatic Detection of Hate Speech Embedded in Arabic Tweets

Rehab Duwairi et al.

Summary: This study compared the performance of different deep learning networks in automatically detecting hateful content on social media, across the ArHS dataset and other Arabic hate speech datasets, showing varied accuracies among CNN, BiLSTM-CNN, and CNN-LSTM models in different classification tasks.

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING (2021)

Article Computer Science, Information Systems

Developing an online hate classifier for multiple social media platforms

Joni Salminen et al.

HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES (2020)

Article Chemistry, Multidisciplinary

A Deep Learning Approach for Automatic Hate Speech Detection in the Saudi Twittersphere

Raghad Alshalan et al.

APPLIED SCIENCES-BASEL (2020)

Proceedings Paper Computer Science, Artificial Intelligence

Classification of Cyberbullying Text in Arabic

Benaissa Azzeddine Rachid et al.

2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) (2020)

Article Computer Science, Information Systems

Misogyny Detection in Twitter: a Multilingual and Cross-Domain Study

Endang Wahyu Pamungkas et al.

INFORMATION PROCESSING & MANAGEMENT (2020)

Article Computer Science, Information Systems

Deep Learning Based Fusion Approach for Hate Speech Detection

Yanling Zhou et al.

IEEE ACCESS (2020)

Article Information Science & Library Science

Sentiment analysis of extremism in social media from textual information

Muhammad Asif et al.

TELEMATICS AND INFORMATICS (2020)

Article Computer Science, Artificial Intelligence

Hate speech detection: A solved problem? The challenging case of long tail on Twitter

Ziqi Zhang et al.

SEMANTIC WEB (2019)

Proceedings Paper Computer Science, Theory & Methods

Augment to Prevent: Short-Text Data Augmentation in Deep Learning for Hate-Speech Classification

Georgios Rizos et al.

PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19) (2019)

Article Computer Science, Artificial Intelligence

Challenges in discriminating profanity from hate speech

Shervin Malmasi et al.

JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE (2018)

Article Computer Science, Information Systems

Towards enhancement of a lexicon-based approach for Saudi dialect sentiment analysis

Adel Assiri et al.

JOURNAL OF INFORMATION SCIENCE (2018)

Article Computer Science, Theory & Methods

A Survey on Automatic Detection of Hate Speech in Text

Paula Fortuna et al.

ACM COMPUTING SURVEYS (2018)

Article Computer Science, Artificial Intelligence

Effective hate-speech detection in Twitter data using recurrent neural networks

Georgios K. Pitsilis et al.

APPLIED INTELLIGENCE (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Dataset Construction for the Detection of Anti-Social Behaviour in Online Communication in Arabic

Azalden Alakrot et al.

ARABIC COMPUTATIONAL LINGUISTICS (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Towards Accurate Detection of Offensive Language in Online Communication in Arabic

Azalden Alakrot et al.

ARABIC COMPUTATIONAL LINGUISTICS (2018)

Proceedings Paper Computer Science, Information Systems

Mean Birds: Detecting Aggression and Bullying on Twitter

Despoina Chatzakou et al.

PROCEEDINGS OF THE 2017 ACM WEB SCIENCE CONFERENCE (WEBSCI '17) (2017)

Proceedings Paper Computer Science, Information Systems

Deep Learning for Hate Speech Detection in Tweets

Pinkesh Badjatiya et al.

WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (2017)

Article Mathematics, Interdisciplinary Applications

Us and them: identifying cyber hate on Twitter across multiple protected characteristics

Pete Burnap et al.

EPJ DATA SCIENCE (2016)

Article Psychology, Multidisciplinary

Cybercrime detection in online communications: The experimental case of cyberbullying detection in the Twitter network

Mohammed Ali Al-garadr et al.

COMPUTERS IN HUMAN BEHAVIOR (2016)

Proceedings Paper Computer Science, Theory & Methods

Hate Speech Detection with Comment Embeddings

Nemanja Djuric et al.

WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (2015)

Article Computer Science, Artificial Intelligence

Extracting relevant knowledge for the detection of sarcasm and nastiness in the social web

Raquel Justo et al.

KNOWLEDGE-BASED SYSTEMS (2014)

Proceedings Paper Computer Science, Theory & Methods

Detecting Offensive Language in Social Media to Protect Adolescent Online Safety

Ying Chen et al.

PROCEEDINGS OF 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY, RISK AND TRUST AND 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM/PASSAT 2012) (2012)

Review Medicine, General & Internal

Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement

David Moher et al.

PLOS MEDICINE (2009)

Review Computer Science, Artificial Intelligence

Inter-Coder Agreement for Computational Linguistics

Ron Artstein et al.

COMPUTATIONAL LINGUISTICS (2008)