☆ 3.8 Proceedings Paper

LRnLA Lattice Boltzmann Method: A Performance Comparison of Implementations on GPU and CPU

PARALLEL COMPUTATIONAL TECHNOLOGIES, PCT 2019 (2019)

期刊

PARALLEL COMPUTATIONAL TECHNOLOGIES, PCT 2019

卷 1063, 期 -, 页码 139-151

出版社

SPRINGER INTERNATIONAL PUBLISHING AG

DOI: 10.1007/978-3-030-28163-2_10

关键词

LRnLA; LBM; Temporal blocking; Time skewing; GPU; Vectorization

类别

Computer Science, Artificial Intelligence Computer Science, Software Engineering Computer Science, Theory & Methods

资金

Russian Science Foundation [18-71-10004]
Russian Science Foundation [18-71-10004] Funding Source: Russian Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

We present an implementation of the Lattice Boltzmann Method (LBM) with Locally Recursive non-Locally Asynchronous (LRnLA) algorithms on GPU and CPU. The algorithm is based on the recursive subdivision of the domain of the dD1T space-time simulation and loosens the memory-bound limit for numerical schemes with local dependencies. We show that LRnLA algorithm allows to overcome the main memory bandwidth limitations in both CPU and GPU implementations. For CPU, we find the data layout that provides alignment for the full use of AVX2/AVX512 vectorization. For GPU, we devise a procedure for pairwise CUDA-block synchronization applied to the implementation of the LRnLA algorithm, which previously worked only on CPU. The performance on GPU is higher, as it is usual in modern implementations. However, the performance gap in our implementation is smaller, thanks to a more efficient CPU version. Through a detailed comparison, we show possible future applications for both the CPU and the GPU implementations of the lattice Boltzmann method in the complex setting.

LRnLA Lattice Boltzmann Method: A Performance Comparison of Implementations on GPU and CPU

期刊

PARALLEL COMPUTATIONAL TECHNOLOGIES, PCT 2019

出版社

SPRINGER INTERNATIONAL PUBLISHING AG

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

LRnLA Lattice Boltzmann Method: A Performance Comparison of Implementations on GPU and CPU

期刊

PARALLEL COMPUTATIONAL TECHNOLOGIES, PCT 2019

出版社

SPRINGER INTERNATIONAL PUBLISHING AG

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文