4.7 Article

HEPart: A balanced hypergraph partitioning algorithm for big data applications

Publisher

ELSEVIER
DOI: 10.1016/j.future.2018.01.009

Keywords

Hypergraph partitioning; Hyperedge partitioning; Load balancing; Big data

Funding

  1. National Natural Science Foundation of China [61632009, 61472451, 61272151, 61402543, 61502163]
  2. High Level Talents Program of Higher Education in Guangdong Province [2016ZJ01]
  3. Natural Science Foundation of Guangdong Province in China [2015A030313638]
  4. Foundation for Distinguished Young Talents in Higher Education of Guangdong in China [2015KQNCX179]
  5. Features Innovation Program of the Department of Education of Guangdong Province [2016KTSCX147]
  6. Hunan Provincial Natural Science Foundation of China [2016JJ3051]

Ask authors/readers for more resources

Minimizing the query cost among multi-hosts is important to data processing for big data applications. Hypergraph is good at modeling data and data relationships of complex networks, the typical big data applications, by representing multi-way relationships or interactions as hyperedges. Hypergraph partitioning (HP) helps to partition the query loads on several hosts, enabling the horizontal scaling of large-scale networks. Existing heuristic HP algorithms are generally vertex hypergraph partitioning, designed to minimize the number of cut hyperedges while satisfying the balance requirements of part weights regarding vertices. However, since workloads are mainly produced by group operations, minimizing query costs landing on hyperedges and balancing the workloads should be the objectives in horizontal scaling. We thus propose a heuristic hyperedge partitioning algorithm, HEPart. Specifically, HEPart directly partitions the hypergraph into K sub-hypergraphs with a minimum cutsize for vertices, while satisfying the balance constraint on hyperedge weights, based on the effective move of hyperedges. The I performance of HEPart is evaluated using several complex network datasets modeled by undirected hypergraphs, under different cutsize metrics. The partitioning quality of HEPart is then compared with alternative hyperedge partitioners and vertex hypergraph partitioning algorithms. The experimental findings demonstrate the utility of HEPart (e.g. low cut cost while keeping load balancing as required, especially over scale-free networks). (C) 2018 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available