4.7 Article

A non-convex semi-supervised approach to opinion spam detection by ramp-one class SVM

Journal

INFORMATION PROCESSING & MANAGEMENT
Volume 57, Issue 6, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.ipm.2020.102381

Keywords

Ramp-one class svm; Opinion spam; Deceptive opinion; Outlier detection

Ask authors/readers for more resources

Nowadays, e-commerce has become a part of our daily life in such a way that people's decision for buying products or choosing services highly depends on comments, reviews, and rates, which are posted on related businesses' website and other social media. Because of the importance and prevalence of these sources of information, fraudsters are tempted to use fraudulently opinion sharing platforms in order to promote or to discredit some target products or services. Although a wide range of approaches have been proposed to address this problem and to help distinguish between the deceptive or fraudulent opinions from the trustful ones, this is still a challenging problem. Lack of well-defined deceptive data samples or insufficiency of spam review instances in the training sets cause supervised techniques facing an imbalanced classes problem. Furthermore, even in the golden normal opinion datasets, there is a possibility of the presence of some abnormal records or outliers. To deal with these two issues, we propose a robust and nonconvex semi-supervised algorithm called Ramp One-Class SVM. In the proposed method, oneclass SVM is adopted to handle the lack of labeled data for the deceptive opinions and by taking the advantages of non-convex properties of the Ramp loss function, we eliminate the effects of outliers and non-review opinions. The performance of the proposed method is evaluated by an artificial dataset and two real datasets including Ott and Yelp crowdsourced datasets. The results show the superiority of our method by achieving an accuracy of 92.13% and 74.37% for Ott and Yelp crowdsourced datasets, respectively. The obtained results also reveal the effectiveness of the proposed model in terms of precision, recall, generalization power, and robustness to the outliers.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available