4.5 Article

Efficient Multiset Synchronization

期刊

IEEE-ACM TRANSACTIONS ON NETWORKING
卷 25, 期 2, 页码 1190-1205

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNET.2016.2618006

关键词

Multiset synchronization; counting bloom filter; invertible bloom filter; invertible counting bloom filter

资金

  1. National Natural Science Foundation for Outstanding Excellent Young Scholars of China [61422214]
  2. National Basic Research Program (973 program) [2014CB347800]
  3. National Natural Science Foundation of China [61661015]
  4. Program for New Century Excellent Talents in University
  5. Hunan Provincial Natural Science Fund for Distinguished Young Scholars [2016JJ1002]
  6. NUDT [JQ14-05-02, ZDYYJCYJ20140601]

向作者/读者索取更多资源

Set synchronization is an essential job for distributed applications. In many cases, given two sets A and B, applications need to identify those elements that appear in set A but not in set B, and vice versa. Bloom filter, a space-efficient data structure for representing a set and supporting membership queries, has been employed as a lightweight method to realize set synchronization with a low false positive probability. Unfortunately, bloom filters and their variants can only be applied to simple sets rather than more general multisets, which allow elements to appear multiple times. In this paper, we first examine the potential of addressing the multiset synchronization problem based on two existing variants of the bloom filters: the IBF and the counting bloom filter (CBF). We then design a novel data structure, invertible CBF (ICBF), which represents a multiset using a vector of cells. Each cell contains two fields, id and count, which record the identifiers and number of elements mapped into them, respectively. Given two multisets, based on the encoding results, the ICBF can execute the dedicated subtracting and decoding operations to recognize the different elements and differences in the multiplicities of elements between the two multisets. We conduct comprehensive experiments to evaluate and compare the three dedicated multiset synchronization approaches proposed in this paper. The evaluation results indicate that the ICBF-based approach outperforms the other two approaches in terms of synchronization accuracy, time-consumption, and communication overhead.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据