4.7 Article

Distantly Supervised Lifelong Learning for Large-Scale Social Media Sentiment Analysis

Journal

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING
Volume 8, Issue 4, Pages 480-491

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TAFFC.2017.2771234

Keywords

Sentiment analysis; social media analysis; distant supervision; lifelong learning

Funding

  1. Natural Science Foundation of China [61672288]
  2. Natural Science Foundation of Jiangsu Province for Excellent Young Scholars [BK20160085]

Ask authors/readers for more resources

Although sentiment analysis on traditional online texts has been studied in depth, sentiment analysis for social media texts is still a challenging research direction. In the social media that contains a huge amount of texts and a large range of topics, it would be very difficult to manually collect enough labeled data to train a sentiment classifier for different domains. Distant supervision that considers emoticons as natural sentiment labels in the microblog texts has been widely used in social media sentiment analysis. However, the previous distant supervision works were normally trained based on an isolate set of data, and they were not capable to deal with the scenario where the texts are continuously increasing and the topics are constantly changing. To address such challenges, in this work we propose a distantly supervised lifelong learning framework for large-scale social media sentiment analysis. The key characteristic of our approach is continuous sentiment learning in social media. It learns on past tasks sequentially, retains the knowledge obtained from past learning and uses the past knowledge to help future learning. The lifelong sentiment classifier is trained on two large-scale distantly supervised social media datasets respectively, and evaluated on nine benchmark datasets. The results prove that our lifelong sentiment learning approach is feasible and effective to tackle the challenges of continuously updated texts with dynamic topics in social media. We also prove that the belief the more training data the better performance does not hold in large-scale social media sentiment analysis. In contrast, by conducting continuous learning from past tasks, our approach beats the traditional way of using all training data in one task, in terms of both classification performance and computational efficiency.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available