4.7 Article

A General Design for a Scalable MPI-GPU Multi-Resolution 2D Numerical Solver

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TPDS.2019.2961909

关键词

CUDA; multi-GPU; MPI; dynamic load balancing; hilbert space filling curves; multi-resolution grid; shallow water equations (SWE); AMR

资金

  1. CSCS (Switzerland)
  2. CINECA (Italy)
  3. STFC (U.K.)
  4. Italian MIUR, under the Scientific Independence of young researchers program [RBSI14R1GP, D92I15000190001]
  5. Italian INdAM-GNCS Project 2019

向作者/读者索取更多资源

This article presents a multi-GPU implementation of a Finite-Volume solver on a multi-resolution grid. The implementation completely offloads the computation to the GPUs and communications between different GPUs are implemented by means of the Message Passing Interface (MPI) API. Different domain decomposition techniques have been considered and the one based on the Hilbert Space Filling Curves (HSFC) showed optimal scalability. Several optimizations are introduced: One-to-one MPI communications among MPI ranks are completely masked by GPU computations on internal cells and a novel dynamic load balancing algorithm is introduced to minimize the waiting times at global MPI synchronization barriers. Such algorithm adapts the computational load of ranks in response to dynamical changes in the execution time of blocks and in network performances; Its capability to converge to a balanced computation has been empirically shown by numerical experiments. Tests exploit up to 64 GPUs and 83M cells and achieve an efficiency of 90 percent in weak scalability and 85 percent for strong scalability. The framework is general and the results of the article can be ported to a wide range of explicit 2D Partial Differential Equations solvers.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据