☆ 4.7 Article

Interpreting the Public Sentiment Variations on Twitter

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2014)

Journal

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

Volume 26, Issue 5, Pages 1158-1170

Publisher

IEEE COMPUTER SOC

DOI: 10.1109/TKDE.2013.116

Keywords

Twitter; public sentiment; emerging topic mining; sentiment analysis; latent Dirichlet allocation; Gibbs sampling

Funding

Institute for Collaborative Biotechnologies through U.S. Army Research Office [W911NF-09-0001]
National Basic Research Program of China (973 Program) [2012CB316400]
NSF [0905084, 0917228]
National Natural Science Foundation of China [61125203, 61173186, 61373118]
Direct For Computer & Info Scie & Enginr [0905084] Funding Source: National Science Foundation
Direct For Computer & Info Scie & Enginr
Div Of Information & Intelligent Systems [0917228] Funding Source: National Science Foundation
Div Of Information & Intelligent Systems [0905084] Funding Source: National Science Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Millions of users share their opinions on Twitter, making it a valuable platform for tracking and analyzing public sentiment. Such tracking and analysis can provide critical information for decision making in various domains. Therefore it has attracted attention in both academia and industry. Previous research mainly focused on modeling and tracking public sentiment. In this work, we move one step further to interpret sentiment variations. We observed that emerging topics (named foreground topics) within the sentiment variation periods are highly related to the genuine reasons behind the variations. Based on this observation, we propose a Latent Dirichlet Allocation (LDA) based model, Foreground and Background LDA (FB-LDA), to distill foreground topics and filter out longstanding background topics. These foreground topics can give potential interpretations of the sentiment variations. To further enhance the readability of the mined reasons, we select the most representative tweets for foreground topics and develop another generative model called Reason Candidate and Background LDA (RCB-LDA) to rank them with respect to their popularity within the variation period. Experimental results show that our methods can effectively find foreground topics and rank reason candidates. The proposed models can also be applied to other tasks such as finding topic differences between two sets of documents.

Interpreting the Public Sentiment Variations on Twitter

Journal

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Interpreting the Public Sentiment Variations on Twitter

Journal

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper