☆ 3.8 Proceedings Paper

Deep Headline Generation for Clickbait Detection

2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) (2018)

期刊

2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM)

卷 -, 期 -, 页码 467-476

出版社

IEEE

DOI: 10.1109/ICDM.2018.00062

关键词

Data augmentation; deep generative model; clickbait detection

类别

Computer Science, Artificial Intelligence Computer Science, Information Systems

资金

National Science Foundation (NSF) [1614576]
Office of Naval Research (ONR) [N00014-17-1-2605]
NSF [1422215, 1663343, 1742702, 1820609]
Direct For Computer & Info Scie & Enginr
Division Of Computer and Network Systems [1422215] Funding Source: National Science Foundation
Direct For Education and Human Resources
Division Of Graduate Education [1663343] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Clickbaits are catchy social posts or sensational headlines that attempt to lure readers to click. Clickbaits are pervasive on social media and can have significant negative impacts on both users and media ecosystem. For example, users may be misled to receive inaccurate information, or fall into click-jacking attacks. Similarly, media platforms could lose readers' trust and revenues due to the prevalence of clickbaits. To computationally detect such clickbaits on social media using supervised learning framework, one of the major obstacles is the lack of large-scale labeled training data, due to laborious and costly labeling. With the recent advancements in deep generative models, to address this challenge, we propose to generate synthetic headlines with specific styles and explore their utilities to help improve clickbait detection. In particular, we propose to generate stylized headlines from original documents with style transfer. Furthermore, as it is non-trivial to generate stylized headlines due to several challenges such as the discrete nature of texts and the requirements of preserving semantic meaning of the document while achieving style transfer, we propose a novel solution, named as Stylized Headline Generation (SHG), that can not only generate readable and realistic headlines to enlarge original training data, but also helps improve the classification capacity of supervised learning. The experimental results on real-world datasets demonstrate the effectiveness of SHG on generating high-quality and high-utility headlines for clickbait detection.

Deep Headline Generation for Clickbait Detection

期刊

2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM)

出版社

IEEE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Deep Headline Generation for Clickbait Detection

期刊

2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM)

出版社

IEEE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文