4.7 Article

POS-RS: A Random Subspace method for sentiment classification based on part-of-speech analysis

Journal

INFORMATION PROCESSING & MANAGEMENT
Volume 51, Issue 4, Pages 458-479

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.ipm.2014.09.004

Keywords

Sentiment classification; Random Subspace; Part of speech; Ensemble learning

Funding

  1. National Natural Science Foundation of China [71071045, 71131002, 71101042, 71471054]
  2. National Basic Research Program of China (973 Program) [2013CB329603]
  3. Specialized Research Fund for the Doctoral Program of Higher Education [20110111120014]
  4. China Postdoctoral Science Foundation [2011M501041, 2013T60611]
  5. Special Fund of AnHui Province Key Research Institute of Humanities and Social Sciences at Universities [SK2013B400]
  6. Special Fund of Political Theory Research Center of HeFei University of Technology [2012HGXJ0392]

Ask authors/readers for more resources

With the rise of Web 2.0 platforms, personal opinions, such as reviews, ratings, recommendations, and other forms of user-generated content, have fueled interest in sentiment classification in both academia and industry. In order to enhance the performance of sentiment classification, ensemble methods have been investigated by previous research and proven to be effective theoretically and empirically. We advance this line of research by proposing an enhanced Random Subspace method, POS-RS, for sentiment classification based on part-of-speech analysis. Unlike existing Random Subspace methods using a single subspace rate to control the diversity of base learners, POS-RS employs two important parameters, i.e. content lexicon subspace rate and function lexicon subspace rate, to control the balance between the accuracy and diversity of base learners. Ten publicly available sentiment data-sets were investigated to verify the effectiveness of proposed method. Empirical results reveal that POS-RS achieves the best performance through reducing bias and variance simultaneously compared to the base learner, i.e., Support Vector Machine. These results illustrate that POS-RS can be used as a viable method for sentiment classification and has the potential of being successfully applied to other text classification problems. (C) 2014 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available