4.4 Article

Dynamic Scaling for Parallel Graph Computations

Journal

PROCEEDINGS OF THE VLDB ENDOWMENT
Volume 12, Issue 8, Pages 877-890

Publisher

ASSOC COMPUTING MACHINERY
DOI: 10.14778/3324301.3324305

Keywords

-

Funding

  1. ERC [652976]
  2. Royal Society Wolfson Research Merit Award [WRM/R1/180014]
  3. EPSRC [EP/M025268/1]
  4. 973 Project [2014CB340302]
  5. NSFC [61421003, 61602023]
  6. Shenzhen Institute of Computing Sciences
  7. Beijing Advanced Innovation Center for Big Data and Brain Computing
  8. Engineering and Physical Sciences Research Council, ESPRC Centre for Doctoral Training in Pervasive Parallelism at the University of Edinburgh, School of Informatics [EP/L01503X/1]
  9. EPSRC [EP/M507258/1] Funding Source: UKRI

Ask authors/readers for more resources

This paper studies scaling out/in to cope with load surges. Given a graph G that is vertex-partitioned and distributed across n processors, it is to add (resp. remove) k processors and re-distribute G across n + k (resp. n - k) processors such that the load among the processors is balanced, and its replication factor and migration cost are minimized. We show that this tri-criteria optimization problem is intractable, even when k is a constant and when either load balancing or minimum migration is not required. Nonetheless, we propose two parallel solutions to dynamic scaling. One consists of approximation algorithms by extending consistent hashing. Given a load balancing factor above a lower bound, the algorithms guarantee provable bounds on both replication factor and migration cost. The other is a generic scaling scheme. Given any existing vertex-partitioner VP of users' choice, it adaptively scales VP in and out such that it incurs minimum migration cost, and ensures balance and replication factors within a bound relative to that of VP. Using real-life and synthetic graphs, we experimentally verify the efficiency, effectiveness and scalability of the solutions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available