3.8 Proceedings Paper

Deep Headline Generation for Clickbait Detection

出版社

IEEE
DOI: 10.1109/ICDM.2018.00062

关键词

Data augmentation; deep generative model; clickbait detection

资金

  1. National Science Foundation (NSF) [1614576]
  2. Office of Naval Research (ONR) [N00014-17-1-2605]
  3. NSF [1422215, 1663343, 1742702, 1820609]
  4. Direct For Computer & Info Scie & Enginr
  5. Division Of Computer and Network Systems [1422215] Funding Source: National Science Foundation
  6. Direct For Education and Human Resources
  7. Division Of Graduate Education [1663343] Funding Source: National Science Foundation

向作者/读者索取更多资源

Clickbaits are catchy social posts or sensational headlines that attempt to lure readers to click. Clickbaits are pervasive on social media and can have significant negative impacts on both users and media ecosystem. For example, users may be misled to receive inaccurate information, or fall into click-jacking attacks. Similarly, media platforms could lose readers' trust and revenues due to the prevalence of clickbaits. To computationally detect such clickbaits on social media using supervised learning framework, one of the major obstacles is the lack of large-scale labeled training data, due to laborious and costly labeling. With the recent advancements in deep generative models, to address this challenge, we propose to generate synthetic headlines with specific styles and explore their utilities to help improve clickbait detection. In particular, we propose to generate stylized headlines from original documents with style transfer. Furthermore, as it is non-trivial to generate stylized headlines due to several challenges such as the discrete nature of texts and the requirements of preserving semantic meaning of the document while achieving style transfer, we propose a novel solution, named as Stylized Headline Generation (SHG), that can not only generate readable and realistic headlines to enlarge original training data, but also helps improve the classification capacity of supervised learning. The experimental results on real-world datasets demonstrate the effectiveness of SHG on generating high-quality and high-utility headlines for clickbait detection.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据