3.8 Proceedings Paper

Diversity-Aware Top-k Publish/Subscribe for Text Stream

出版社

ASSOC COMPUTING MACHINERY
DOI: 10.1145/2723372.2749451

关键词

text stream; diversification; publish/subscribe

资金

  1. Singapore MOE AcRF Tier 2 Grant [ARC30/12]
  2. Microsoft Research

向作者/读者索取更多资源

Massive amount of text data are being generated by a huge number of web users at an unprecedented scale. These data cover a wide range of topics. Users are interested in receiving a few up-to-date representative documents (e.g., tweets) that can provide them with a wide coverage of different aspects of their query topics. To address the problem, we consider the Diversity-Aware Top k Subscription (DAS) query. Given a DAS query, we continuously maintain an up-to-date result set that contains k most recently returned documents over a text stream for the query. The DAS query takes into account text relevance, document recency, and result diversity. We propose a novel solution to efficiently processing a large number of DAS queries over a stream of documents. We demonstrate the efficiency of our approach on real world dataset and the experimental results show that our solution is able to achieve a reduction of the processing time by 60-75% compared with two baselines. We also study the effectiveness of the DAS query.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据