4.4 Article

Sliding-window top-k queries on uncertain streams

Journal

VLDB JOURNAL
Volume 19, Issue 3, Pages 411-435

Publisher

SPRINGER
DOI: 10.1007/s00778-009-0171-0

Keywords

Uncertain stream; Top-k query; Sliding-window

Funding

  1. National Natural Science Foundation of China [60803020, 60933001, 60925008]
  2. Hong Kong Direct Allocation Grant [DAG07/08]
  3. RGC HKUST [6119/07E]
  4. National Basic Research Program of China (973 Program) [2006CB303000]
  5. RGC HKSAR, China [418206]
  6. ARC [DP0666428, DP0881035]

Ask authors/readers for more resources

Recently, due to the imprecise nature of the data generated from a variety of streaming applications, such as sensor networks, query processing on uncertain data streams has become an important problem. However, all the existing works on uncertain data streams study unbounded streams. In this paper, we take the first step towards the important and challenging problem of answering sliding-window queries on uncertain data streams, with a focus on one of the most important types of queries-top-k queries. It is nontrivial to find an efficient solution for answering sliding-window top-k queries on uncertain data streams, because challenges not only stem from the strict space and time requirements of processing both arriving and expiring tuples in high-speed streams, but also rise from the exponential blowup in the number of possible worlds induced by the uncertain data model. In this paper, we design a unified framework for processing sliding-window top-k queries on uncertain streams. We show that all the existing top-k definitions in the literature can be plugged into our framework, resulting in several succinct synopses that use space much smaller than the window size, while they are also highly efficient in terms of processing time. We also extend our framework to answering multiple top-k queries. In addition to the theoretical space and time bounds that we prove for these synopses, we present a thorough experimental report to verify their practical efficiency on both synthetic and real data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available