4.6 Article

Improving content popularity prediction with k-means clustering and deep-belief networks

Journal

MULTIMEDIA TOOLS AND APPLICATIONS
Volume 80, Issue 10, Pages 15745-15764

Publisher

SPRINGER
DOI: 10.1007/s11042-020-10463-x

Keywords

Content popularity prediction; Deep-belief network (DBN); K-means clustering with Pearson distance; User generated content (UGC)

Ask authors/readers for more resources

This paper introduces an improved method for predicting content popularity in the early stage, using k-means clustering and DBN models, which significantly improves performance and outperforms other methods in reducing errors.
User-Generated Content (UGC) is turning into the predominant type of internet traffic. Content popularity prediction plays a pivotal role in managing this large-scale traffic. As a result, popularity prediction is increasingly becoming an important area of research in computer networking. Generally, popularity prediction methods are classified into two groups, namely, feature-driven and early-stage. While feature-driven methods predict content popularity before publication, early-stage methods monitor early content popularities to forecast the future. Many papers have shown that early-stage popularity prediction performs better than feature-driven methods. In this paper, we improve the performance of early-stage popularity prediction by first, classifying the data into several clusters using k-means clustering with Pearson correlation distance, and then, training a Deep-Belief Network (DBN) for each cluster. We evaluate our method using a dataset of YouTube videos and show that using a generative model such as DBN for time series prediction significantly improves the performance. Numerical results indicate that our proposed method outperforms other state-of-the-art methods by reducing Mean Absolute Percentage Error (MAPE) and mean Relative Square Error (mRSE) by up to 47.86% and 25.18%.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available