☆ 4.7 Review

An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews

EXPERT SYSTEMS WITH APPLICATIONS (2018)

Journal

EXPERT SYSTEMS WITH APPLICATIONS

Volume 114, Issue -, Pages 210-223

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.eswa.2018.07.005

Keywords

Deceptive review detection; Topic-sentiment joint probabilistic model; Latent dirichlet allocation; Gibbs sampling

Funding

Natural Science Foundation of China [71772107, 71403151, 61502281, 61433012]
Key R&D Plan of Shandong Province [2018GGX101045]
Natural Science Foundation of Shandong Province [ZR2018BF013, ZR2013FM023, ZR2014FP011]
Shandong Education Quality Improvement Plan for Postgraduate
China's Post-doctoral Science Fund [2014M561948]
Postdoctoral innovation project special funds of Shandong Province [201403007]
Applied research project for Qingdao postdoctoral researcher
Project of Shandong Province Higher Educational Science and Technology Program [J14LN33]
Leading talent development program of Shandong University of Science and Technology
Special funding for Taishan scholar construction project

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

In electronic commerce, online reviews play very important roles in customers' purchasing decisions. Unfortunately, malicious sellers often hire buyers to fabricate fake reviews to improve their reputation. In order to detect deceptive reviews and mine the topics and sentiments from the reviews, in this paper, we propose an unsupervised topic-sentiment joint probabilistic model (UTSJ) based on Latent Dirichlet Allocation (LDA) model. This model first employs Gibbs sampling algorithm to approximate parameters of maximum likelihood function offline and obtain topic-sentiment joint probabilistic distribution vector for each review. Secondly, a Random Forest classifier and a SVM (Support Vector Machine) classifier are trained offline, respectively. Experimental results on real-life datasets show that our proposed model is better than baseline models such as n-grams, character n-grams in token, POS (part-of-speech), LDA, and JST (Joint Sentiment/Topic). Moreover, our UTSJ model outperforms or performs similarly to benchmark models in detecting deceptive reviews over balanced dataset and unbalanced dataset in different domains. Particularly, our UTSJ model is good at dealing with real-life unbalanced big data, which makes it very suitable for being applied in e-commerce environment. (C) 2018 Elsevier Ltd. All rights reserved.

An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews

Journal

EXPERT SYSTEMS WITH APPLICATIONS

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews

Journal

EXPERT SYSTEMS WITH APPLICATIONS

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper