4.7 Article

Multi-GPU Efficient Indexing For Maximizing Parallelism of High Dimensional Range Query Services

Journal

IEEE TRANSACTIONS ON SERVICES COMPUTING
Volume 15, Issue 5, Pages 2910-2924

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TSC.2021.3079580

Keywords

Graphics processing units; Parallel processing; Pipelines; Indexing; Servers; Query processing; Periodic structures; GPU; parallel algorithms; indexing methods

Funding

  1. Institute of Information & Communications Technology Planning & Evaluation (IITP) through South Korea Government (MSIT) [2020-0-01389]
  2. Artificial Intelligence Convergence Research Center, Inha University
  3. NSF [1564097, 2038029]
  4. IBM
  5. Directorate for STEM Education
  6. Division Of Graduate Education [2038029] Funding Source: National Science Foundation

Ask authors/readers for more resources

In this article, we propose a novel LBPG-tree indexing method that combines parallelization on a single GPU with a distributed framework, leading to efficient high-dimensional range query service.
Numerous research efforts have been proposed for efficient processing of range queries in high-dimensional space by either redesigning R-tree access structure for exploring massive parallelism on single GPU or exploring distributed framework of R-tree. However, none of the existing efforts explores the integration of the parallelization of the R-tree on a single GPU with a distributed framework for the R-tree. The problem of designing an efficient multi-GPU indexing method, which can effectively combine the parallelism maximization with distributed processing of the R-tree, remains an open challenge. In this article, we present a novel multi-GPU efficient parallel and distributed indexing method, called LBPG-tree. The rationale of the LBPG-tree is to combine the advantages of an instruction pipeline in CPU with the massive parallel processing potential of multiple GPUs by introducing two new optimization strategies: First, we exploit the GPU L2 cache for accelerating both index search and index node access on GPUs. Second, we further improve utilization of L2 cache on GPUs by compacting and sorting candidate nodes called Compact-and-Sort. Our experimental results show that the LBPG-tree outperforms G-tree, the previous representative GPU index method and effectively support multiple GPUs for providing efficient high dimensional range query service.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available