☆ 4.1 Article

Stacking-Based Ensemble Learning of Self-Media Data for Marketing Intention Detection

FUTURE INTERNET (2019)

期刊

FUTURE INTERNET

卷 11, 期 7, 页码 -

出版社

MDPI

DOI: 10.3390/fi11070155

关键词

marketing intention; feature extraction; ensemble learning

类别

Computer Science, Information Systems

资金

National Natural Science Foundation of China [61772231]
Shandong Provincial Natural Science Foundation [ZR2017MF025]
Shandong Provincial Key R&D Program of China [2018CXGC0706]
Science and Technology Program of University of Jinan [XKY1734, XKY1828]
Project of Shandong Provincial Social Science Program [18CHLJ39]
Guidance Ability Improving Program of Postgraduate Tutor in University of Jinan [YJZ1801]
Teaching Research Project of University of Jinan [JZ1807]
Project of Independent Cultivated Innovation Team of Jinan City [2018GXRC002]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Social network services for self-media, such as Weibo, Blog, and WeChat Public, constitute a powerful medium that allows users to publish posts every day. Due to insufficient information transparency, malicious marketing of the Internet from self-media posts imposes potential harm on society. Therefore, it is necessary to identify news with marketing intentions for life. We follow the idea of text classification to identify marketing intentions. Although there are some current methods to address intention detection, the challenge is how the feature extraction of text reflects semantic information and how to improve the time complexity and space complexity of the recognition model. To this end, this paper proposes a machine learning method to identify marketing intentions from large-scale We-Media data. First, the proposed Latent Semantic Analysis (LSI)-Word2vec model can reflect the semantic features. Second, the decision tree model is simplified by decision tree pruning to save computing resources and reduce the time complexity. Finally, this paper examines the effects of classifier associations and uses the optimal configuration to help people efficiently identify marketing intention. Finally, the detailed experimental evaluation on several metrics shows that our approaches are effective and efficient. The F1 value can be increased by about 5%, and the running time is increased by 20%, which prove that the newly-proposed method can effectively improve the accuracy of marketing news recognition.

Stacking-Based Ensemble Learning of Self-Media Data for Marketing Intention Detection

期刊

FUTURE INTERNET

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Stacking-Based Ensemble Learning of Self-Media Data for Marketing Intention Detection

期刊

FUTURE INTERNET

出版社

MDPI

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文