4.7 Article

Improved MPC Algorithms for Edit Distance and Ulam Distance

Journal

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS
Volume 32, Issue 11, Pages 2764-2776

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPDS.2021.3076534

Keywords

MapReduce; parallel algorithms; approximation algorithms; ulam distance; edit distance

Ask authors/readers for more resources

The paper introduces an almost optimal MPC algorithm for Ulam distance and improves the algorithms for edit distance, achieving almost linear running time. Progress has also been made in terms of total memory and reducing the number of machines needed.
Edit distance is one of the most fundamental problems in combinatorial optimization to measure the similarity between strings. Ulam distance is a special case of edit distance where no character is allowed to appear more than once in a string. Recent developments have been very fruitful for obtaining fast and parallel algorithms for both edit distance and Ulam distance. In this work, we present an almost optimal MPC (massively parallel computation) algorithm for Ulam distance and improve MPC algorithms for edit distance. Our algorithm for Ulam distance is almost optimal in the sense that (1) the approximation factor of our algorithm is 1 + epsilon, (2) the round complexity of our algorithm is constant, (3) the total memory of our algorithm is almost linear ((O) over tilde (epsilon)(n)), and (4) the overall running time of our algorithm is almost linear which is the best known for Ulam distance. We also improve the work of Hajiaghayi et al. for edit distance in terms of total memory. The best previously known MPC algorithm for edit distance requires (O) over tilde (n(2x))machines when the memory of each machine is bounded by (O) over tilde (n(1-x)). In this work, we improve the number of machines to (O) over tilde (n((9/5x)) while keeping the memory limit intact. Moreover, the round complexity of our algorithm is constant and the total running time of our algorithm is truly subquadratic. However, our improvement comes at the expense of a constant factor in the approximation guarantee of the algorithm. This improvement is inspired by the recent techniques of Boroujeni et al. and Chakraborty et al. for obtaining truly subquadratic time algorithms for edit distance.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available