4.5 Article

Scalable multi-relaxation-time lattice Boltzmann simulations on multi-GPU cluster

期刊

COMPUTERS & FLUIDS
卷 110, 期 -, 页码 1-8

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compfluid.2014.12.010

关键词

Multi relaxation time (MRT); Lattice Boltzmann model (LBM); Graphic processing unit; Message Passing Interface; Three dimensional lid-driven cavity flow

资金

  1. Ministry of Science and Technology of Taiwan [NSC 102-2221-E-007-060-MY3]
  2. Low carbon energy research center of National Tsing Hua University - Taiwan

向作者/读者索取更多资源

In this paper, the D3Q19 multi-relaxation-time lattice Boltzmann model is adopted to simulate three-dimensional cavity flows using graphic processing units (GPUs). For single CPU computations, utilizing on-chip memory generates three to five times speedup over adopting global memory alone. Also, streaming using offset reading attains another two times speedup over employing offset writing, For Message Passing Interface (MPI) based multi-CPU computations, overlapping communication and computation can achieve 38% improvement and provide an efficient scheme to improve the scalability and its performance. Numerical experiments show that 12 Tesla (TM) M2070 CPUs produce around 5500 million lattices updates per second (MLUPS) using 576(3) grid. On the other hand, three GTX Titans deliver 5000 MLUPS for 192(3) grids, while 12 Tesla attain half performance. (C) 2014 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据