☆ 4.4 Article

Semisupervised sentiment analysis method for online text reviews

JOURNAL OF INFORMATION SCIENCE (2021)

Journal

JOURNAL OF INFORMATION SCIENCE

Volume 47, Issue 3, Pages 387-403

Publisher

SAGE PUBLICATIONS LTD

DOI: 10.1177/0165551520910032

Keywords

Adaptive instance-based learning; ensemble learning; lasso regression; semisupervised learning; sentiment analysis; word2vec

Funding

Ministry of Education of the Republic of Korea
National Research Foundation of Korea [NRF-2018S1A3A2075114]
National Research Foundation of Korea [4299990613873] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This article proposes a semi-supervised approach to sentiment analysis that can be trained with only a small amount of labeled data, and through two experiments, it demonstrates that the performance of this method is comparable to that of supervised learning models trained on large datasets.

Sentiment analysis plays an important role in understanding individual opinions expressed in websites such as social media and product review sites. The common approaches to sentiment analysis use the sentiments carried by words that express opinions and are based on either supervised or unsupervised learning techniques. The unsupervised learning approach builds a word-sentiment dictionary, but it requires lengthy time periods and high costs to build a reliable dictionary. The supervised learning approach uses machine learning models to learn the sentiment scores of words; however, training a classifier model requires large amounts of labelled text data to achieve a good performance. In this article, we propose a semisupervised approach that performs well despite having only small amounts of labelled data available for training. The proposed method builds a base sentiment dictionary from a small training dataset using a lasso-based ensemble model with minimal human effort. The scores of words not in the training dataset are estimated using an adaptive instance-based learning model. In a pretrained word2vec model space, the sentiment values of the words in the dictionary are propagated to the words that did not exist in the training dataset. Through two experiments, we demonstrate that the performance of the proposed method is comparable to that of supervised learning models trained on large datasets.

Semisupervised sentiment analysis method for online text reviews

Journal

JOURNAL OF INFORMATION SCIENCE

Publisher

SAGE PUBLICATIONS LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Semisupervised sentiment analysis method for online text reviews

Journal

JOURNAL OF INFORMATION SCIENCE

Publisher

SAGE PUBLICATIONS LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper