3.8 Proceedings Paper

CoBaFi - Collaborative Bayesian Filtering

出版社

ASSOC COMPUTING MACHINERY
DOI: 10.1145/2566486.2568040

关键词

Collaborative Filtering; Sampling; Clustering

资金

  1. Google
  2. Amazon
  3. IBM Faculty Award
  4. Google Focused Research Award
  5. National Science Foundation [CNS-1314632, IIS-0953330, IIS-0916345, IIS-0911032]
  6. National Science Foundation Graduate Research Fellowship [DGE-1252522]
  7. Army Research Laboratory [W911NF-09-2-0053]

向作者/读者索取更多资源

Given a large dataset of users' ratings of movies, what is the best model to accurately predict which movies a person will like? And how can we prevent spammers from tricking our algorithms into suggesting a bad movie? Is it possible to infer structure between movies simultaneously? In this paper we describe a unified Bayesian approach to Collaborative Filtering that accomplishes all of these goals. It models the discrete structure of ratings and is flexible to the often non-Gaussian shape of the distribution. Additionally, our method finds a co-clustering of the users and items, which improves the model's accuracy and makes the model robust to fraud. We offer three main contributions: (1) We provide a novel model and Gibbs sampling algorithm that accurately models the quirks of real world ratings, such as convex ratings distributions. (2) We provide proof of our model's robustness to spam and anomalous behavior. (3) We use several real world datasets to demonstrate the model's effectiveness in accurately predicting user's ratings, avoiding prediction skew in the face of injected spam, and finding interesting patterns in real world ratings data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据