4.7 Review

A deceptive review detection framework: Combination of coarse and fine-grained features

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 156, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2020.113465

关键词

Deceptive reviews detection; LDA topic model; Deep learning; Coarse-grained features; Fine-grained features

资金

  1. Natural Science Foundation of China [71772107, 61502281]
  2. Qingdao social science planning project [QDSKL1801138]
  3. National Key RD Plan [2018YFC0831002]
  4. Humanity and Social Science Fund of the Ministry of Education [18YJAZH136]
  5. Key R&D Plan of Shandong Province [2018GGX101045]
  6. Natural Science Foundation of Shandong Province [ZR2018BF013]
  7. Innovative Research Foundation of Qingdao [18-2-2-41-jch]
  8. Shandong Education Quality Improvement Plan for Postgraduate
  9. Leading talent development program of Shandong University of Science and Technology
  10. Special funding for Taishan scholar construction project

向作者/读者索取更多资源

Electronic commerce has become a popular shopping mode. To enhance their reputations, attract more customers, and finally obtain more benefits, dishonest sellers often recruit buyers or robots to post a large number of deceptive reviews to mislead users. According to the interpretability of learning results, existing methods for detecting deceptive reviews can be mainly divided into explicit feature-based mining ones and neural network-based implicit feature mining ones. The nature of these works is accurate text classification based on coarse-grained features (e.g., topic, sentence, and document) or fine-grained features (e.g., word). To take full merits of existing approaches, this paper proposes a new framework that explores a method to combine the coarse-grained features and the fine-grained features. In this framework, the coarse-grained implicit semantic features of the topic distribution are learned by the concatenation of a Latent Dirichlet Allocation (LDA) topic model and a 2-layered neural network. The fine-grained implicit semantic features from the word vectors representation of the reviews are parallelly learned by a deep learning framework. Finally, these two granular features are combined and adopted to train a Support Vector Machine (SVM) classifier for detecting whether a review is deceptive or not. To verify the effectiveness and performance of this framework, we derive three models by specifying three popular deep learning models, such as TextCNN, long short-term memory (LSTM), and Bidirectional LSTM (BiLSTM) to learn the fine-grained features. Experimental results on a mixed-domain dataset and balanced/unbalanced in-domain datasets show that all the combination models are superior to the corresponding baseline models considering single features. (C) 2020 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据