☆ 4.7 Article

Implementation of the moving particle semi-implicit method for free-surface flows on GPU clusters

COMPUTER PHYSICS COMMUNICATIONS (2019)

期刊

COMPUTER PHYSICS COMMUNICATIONS

卷 244, 期 -, 页码 13-24

出版社

ELSEVIER

DOI: 10.1016/j.cpc.2019.07.010

关键词

MPS method; Heterogeneous parallelism; GPU; MPI; CUDA

类别

Computer Science, Interdisciplinary Applications Physics, Mathematical

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The moving particle semi-implicit (MPS) method performs well in simulating incompressible flows with free surfaces. Despite its applicability, the MPS method suffers from the fundamental instability problem and high computational cost in its practical application. Substantial research has been conducted on improving the stability and accuracy of the MPS method. Moreover, graphics processing units (CPUs), which are multi-processors that execute many three-dimensional geometric processes at high speed, provide unprecedented capability for scientific computations. However, the usage of a single GPU card is not sufficient for engineering applications that require several million particles that predict the desired physical processes, because the available memory space is insufficient. In this work, the dynamic stability (DS) algorithm and particle shifting (PS) algorithm have been used to overcome the instability and inaccuracies caused by tensile instability and non-uniform particle distribution, respectively. Based on the stable MPS method, a GPU-based MPS code that uses the compute unified device architecture (CUDA) language has been developed. An efficient neighborhood particle search is performed using an indirect method, and the matrix for the pressure Poisson equation (PPE) is assembled in parallel. Based on the single-GPU version, a multi-GPU MPS code has been developed. The approach uses a non-geometric dynamic domain decomposition method that provides homogeneous load balancing whereby different portions (subdomains) of the physical system under study are assigned to different GPUs. Communication between devices is achieved with the use of a message passing interface (MPI). Based on the neighborhood particle search, the techniques for building and updating the halo are described in detail. The speed-up of the single-CPU version is analyzed for different numbers of particles, and the scalability of the multi-GPU version is analyzed for different numbers of particles and different numbers of GPUs. Last, an application with more than 10(7) particles is presented to show the capability of the code in handling large-scale simulations. (C) 2019 Elsevier B.V. All rights reserved.

Implementation of the moving particle semi-implicit method for free-surface flows on GPU clusters

期刊

COMPUTER PHYSICS COMMUNICATIONS

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Implementation of the moving particle semi-implicit method for free-surface flows on GPU clusters

期刊

COMPUTER PHYSICS COMMUNICATIONS

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文