4.3 Article Proceedings Paper

Finding frequent items in data streams

期刊

THEORETICAL COMPUTER SCIENCE
卷 312, 期 1, 页码 3-15

出版社

ELSEVIER
DOI: 10.1016/S0304-3975(03)00400-6

关键词

frequent items; streaming algorithm; approximation

向作者/读者索取更多资源

We present a I-pass algorithm for estimating the most frequent items in a data stream using limited storage space. Our method relies on a data structure called a COUNT SKETCH, which allows us to reliably estimate the frequencies of frequent items in the stream. Our algorithm achieves better space bounds than the previously known best algorithms for this problem for several natural distributions on the item frequencies. In addition, our algorithm leads directly to a 2-pass algorithm for the problem of estimating the items with the largest (absolute) change in frequency between two data streams. To our knowledge, this latter problem has not been previously studied in the literature. (C) 2003 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据