4.6 Article

Sentiment analysis and spam detection in short informal text using learning classifier systems

期刊

SOFT COMPUTING
卷 22, 期 21, 页码 7281-7291

出版社

SPRINGER
DOI: 10.1007/s00500-017-2729-x

关键词

Sentiment analysis; Spam detection; Learning classifier systems; High-dimensional; Sparseness

资金

  1. NSFC [61472022, 61421003, SKLSDE-2016ZX-11]
  2. Beijing Advanced Innovation Center for Big Data and Brain Computing

向作者/读者索取更多资源

Sentiment analysis of public views and spam detection from social media text messages are two challenging data analysis tasks due to short informal text. This paper investigates the performance of learning classifier systems (LCS), which are rule-based machine learning techniques, in sentiment analysis of twitter messages and movie reviews, and spam detection from SMS and email data sets. In this study, an existing LCS technique is extended by introducing a novel encoding scheme to represent classifier rules in order to handle the sparseness in feature vectors, which are generated using the term frequency inverse document frequency of word n-grams and sentiment lexicons. The obtained results show that the proposed encoding scheme smoothed the learning process and generated consistently good results in all experiments conducted in this study.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据