4.6 Article

Monotone submodular subset for sentiment analysis of online reviews

Journal

NEURAL COMPUTING & APPLICATIONS
Volume 33, Issue 19, Pages 12381-12396

Publisher

SPRINGER LONDON LTD
DOI: 10.1007/s00521-021-05845-7

Keywords

Subset selection; Sentiment analysis; Sentiment subset selection; Submodular function optimization

Ask authors/readers for more resources

With the prosperity of online social media, there has been a significant increase in user-generated reviews. This paper introduces a sentiment subset selection framework to filter irrelevant sentiment information and select subsets based on topic modeling and submodular maximization with a cardinality constraint. Empirical analysis shows that the proposed framework can compress sentiment corpus while maintaining classifier performance on different metrics.
Along with online social media's prosperity, the amount of user-generated reviews dramatically increases. The kinds of text-based user-generated content are conducive to estimating public sentiments. Many sentiment analysis works are based on the assumption that the sentiment expressed in online reviews can be retrieved from general text features. However, text redundancy and quantity can potentially impact the analysis performance, especially when strict corpus size constraints are applied. This paper proposes a sentiment subset selection framework to construct a small set of documents from the original corpus to convey a subjective representation. The framework can filter irrelevant sentiment information based on topic modeling and select subsets by submodular maximization with respect to a cardinality constraint. Our proposed score function can facilitate the framework to capture fine-grained sentiment features expressed in reviews compared with the conventional submodular-based one. An empirical analysis for the efficacy of the proposed sentiment subset selection framework (SentiSS) on different context domains is conducted. The comparative study of the subset's metric impact on different sentiment levels, namely positive, neural, and negative, is also performed. Experimental results show that the SentiSS framework can compress the sentiment corpus and maintain the classifier's performance on the metrics at the same time.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available