4.5 Article

An Automated System to Predict Popular Cybersecurity News Using Document Embeddings

Journal

CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES
Volume 127, Issue 2, Pages 533-547

Publisher

TECH SCIENCE PRESS
DOI: 10.32604/cmes.2021.014355

Keywords

Embeddings; semantics; cosine similarity; popularity; word2vec

Funding

  1. Korea Institute for Advancement of Technology (KIAT) - Korea Government (MOTIE) [P0012724]
  2. Soonchunhyang University Research Fund

Ask authors/readers for more resources

This study proposes a new approach to estimate the popularity of news articles by adding semantics in the article similarity based approach of popularity estimation. It uses a semantically enriched model to estimate news popularity by measuring cosine similarity between document embeddings of the news articles and Word2vec model to generate distributed representations of the news content. Experimental results show that this approach outperforms other models in predicting news popularity.
The substantial competition among the news industries puts editors under the pressure of posting news articles which are likely to gain more user attention. Anticipating the popularity of news articles can help the editorial teams in making decisions about posting a news article. Article similarity extracted from the articles posted within a small period of time is found to be a useful feature in existing popularity prediction approaches. This work proposes a new approach to estimate the popularity of news articles by adding semantics in the article similarity based approach of popularity estimation. A semantically enriched model is proposed which estimates news popularity by measuring cosine similarity between document embeddings of the news articles. Word2vec model has been used to generate distributed representations of the news content. In this work, we define popularity as the number of times a news article is posted on different websites. We collect data from different websites that post news concerning the domain of cybersecurity and estimate the popularity of cybersecurity news. The proposed approach is compared with different models and it is shown that it outperforms the other models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available