4.7 Article

Sparse Geometries Handling in Lattice Boltzmann Method Implementation for Graphic Processors

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TPDS.2018.2810237

关键词

GPU; CUDA; LBM; CFD; parallel computing

资金

  1. National Science Centre [N N501 042140]
  2. Chair of Computer Engineering, Faculty of Electronics, Wroclaw University of Science and Technology

向作者/读者索取更多资源

We describe a high-performance implementation of the lattice Boltzmann method (LBM) for sparse geometries on graphic processors. In our implementation we cover the whole geometry with a uniform mesh of small tiles and carry out calculations for each tile independently with proper data synchronization at the tile edges. For this method, we provide both a theoretical analysis of complexity and the results for real implementations involving two-dimensional (2D) and three-dimensional (3D) geometries. Based on the theoretical model, we show that tiles offer significantly smaller bandwidth overheads than solutions based on indirect addressing. For 2D lattice arrangements, a reduction in memory usage is also possible, although at the cost of diminished performance. We achieved a performance of 682 MLUPS on GTX Titan (72 percent of peak theoretical memory bandwidth) for the D3Q19 lattice arrangement and double-precision data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据