4.4 Article

Are bitvectors optimal?

Journal

SIAM JOURNAL ON COMPUTING
Volume 31, Issue 6, Pages 1723-1744

Publisher

SIAM PUBLICATIONS
DOI: 10.1137/S0097539702405292

Keywords

data structures; set membership problem; bit probe model; set systems

Ask authors/readers for more resources

We study the static membership problem : Given a set S of at most n keys drawn from a universe U of size m, store it so that queries of the form Is u in S? can be answered by making few accesses to the memory. We study schemes for this problem that use space close to the information theoretic lower bound of ( n log( m n)) bits and yet answer queries by reading a small number of bits of the memory. We show that, for epsilon > 0, there is a scheme that stores O (n/epsilon2 log m) bits and answers membership queries using a randomized algorithm that reads just one bit of memory and errs with probability at most. We consider schemes that make no error for queries in S but are allowed to err with probability at most for queries not in S. We show that there exist such schemes that store O ((n/epsilon)(2) log m) bits and answer queries using just one bitprobe. If multiple probes are allowed, then the number of bits stored can be reduced to O(n(1+delta) log m) for any delta > 0. The schemes mentioned above are based on probabilistic constructions of set systems with small intersections. We show lower bounds that come close to our upper bounds (for a large range of n and epsilon) : Schemes that answer queries with just one bitprobe and error probability epsilon must use Omega(n(2)/epsilon(2) log(n/epsilon) log m) bits of storage; if the error is restricted to queries not in S, then the scheme must use Omega(n(2)/epsilon(2) log(n/epsilon) log m) bits of storage. We also consider deterministic schemes for the static membership problem and show tradeoffs between space and the number of probes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available