4.7 Article

Multi-interest semantic changes over time in short-text microblogs

期刊

KNOWLEDGE-BASED SYSTEMS
卷 228, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.knosys.2021.107249

关键词

User profiling; Text mining; Neural Networks; Information retrieval; Short-text microblogs

向作者/读者索取更多资源

Consumption of content in short-text microblogs is influenced by individual users and their social network interests, which change dynamically over time. Detecting semantic changes is important for mapping user profiles, especially in platforms with limited user data. A model was used to validate interest changes over time, achieving a high Pearson correlation coefficient for interest change verification.
Consumption of content in short-text microblogs is necessitated to a large extent by individual users and their friendship network interests. Based on the dynamism in the data throughput on such platforms, e.g., Twitter, prevailing conditions are bound to determine the nature of consumed or disseminated content. Therefore, semantic interests differ over time even for individual users. Detecting this semantic change over time is integral in mapping user profiles over a time period, especially in microblogs where only the extrinsic user profile identifiers provide metadata that seldom evolve. This is vital in serving relevant third-party content as well as in the computation of topical interest variations over time. In essence, current, and relevant topics of interest to a user on such a platform may not be representative of the same users' interests a few months later. In our quest to identify the most user-representative interests at any given time, each topical term was modelled as the inner product between word embeddings and a time-based embedding representation of assigned topics at varied time periods. The model was fitted onto tweets as time-series documents. To validate the model, changes in the extracted user-representative interests over time were semantically weighed against a mirrored, time-variant dataset. Interest weights across the time-variant datasets were computed and validated in five sub-topics for a period spanning two and a half years. Linearity in the relationships between the test and validation sets could be identified, more so in emerging topics. A Pearson correlation coefficient as high as 0.871 was achieved in interest change verification over the tested period. (C) 2021 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据