4.8 Review

Detecting fake-review buyers using network structure: Direct evidence from Amazon

出版社

NATL ACAD SCIENCES
DOI: 10.1073/pnas.2211932119

关键词

online reviews; networks; machine learning; text analysis

资金

  1. Morrison Center for Marketing Analytics

向作者/读者索取更多资源

Online reviews have a significant impact on consumer decision-making and firm economic outcomes. Fake reviews have become a prevalent issue, and despite academic research and platform efforts, their prevalence continues to rise. This study tackles the issue by collecting a dataset of Amazon product reviews and developing a highly accurate method for detecting fake reviews. By directly observing which sellers buy fake reviews, the researchers successfully identify patterns in the product reviewer network that can predict fake review buyers. The network-based approach proves to be more robust to manipulation compared to text or metadata-based methods.
Online reviews significantly impact consumers' decision-making process and firms' economic outcomes and are widely seen as crucial to the success of online markets. Firms, therefore, have a strong incentive to manipulate ratings using fake reviews. This presents a problem that academic researchers have tried to solve for over two decades and on which platforms expend a large amount of resources. Nevertheless, the prevalence of fake reviews is arguably higher than ever. To combat this, we collect a dataset of reviews for thousands of Amazon products and develop a general and highly accurate method for detecting fake reviews. A unique difference between previous datasets and ours is that we directly observe which sellers buy fake reviews. Thus, while prior research has trained models using laboratory-generated reviews or proxies for fake reviews, we are able to train a model using actual fake reviews. We show that products that buy fake reviews are highly clustered in the product reviewer network. Therefore, features constructed from this network are highly predictive of which products buy fake reviews. We show that our network-based approach is also successful at detecting fake review buyers even without ground truth data, as unsupervised clustering methods can accurately identify fake review buyers by identifying clusters of products that are closely connected in the network. While text or metadata can be manipulated to evade detection, network-based features are more costly to manipulate because these features result directly from the inherent limitations of buying reviews from online review marketplaces, making our detection approach more robust to manipulation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据