4.5 Article

HPC performance study of different collision models using the Lattice Boltzmann solver Musubi

期刊

COMPUTERS & FLUIDS
卷 255, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compfluid.2023.105833

关键词

Lattice Boltzmann method; Hybrid recursive regularized BGK; Cumulant; Turbulence modeling; Musubi

向作者/读者索取更多资源

This study compares the accuracy and performance of several collision schemes on a D3Q27 lattice. The parameterized Cumulant collision scheme shows the best agreement with experimental data in the flow past a circular cylinder test at a Reynolds number of 3900. In terms of performance, all collision schemes achieve a strong scaling parallel efficiency above 60% on up to 16384 cores.
Over the past decades, the lattice Boltzmann method (LBM) has become increasingly popular thanks to its capabilities in the domain of Large-Eddy Simulations (LES). Different collision schemes have been proposed to extend the scope of application to higher Reynolds number flows. This study compares the accuracy and the performance of some of these schemes on a D3Q27 lattice, including the original Multiple Relaxation Times (MRT) model, the Hybrid Recursive Regularized Bhatnagar-Gross-Krook (HRR) operator, as well as the Projected Recursive Regularized Bhatnagar-Gross-Krook (PRR) operator and the parametrized Cumulant collision scheme. For this purpose, the above-mentioned schemes are implemented in the HPC LBM solver Musubi and tested on a well-documented test-case describing the flow past a circular cylinder at a Reynolds number of 3900. Three different subgrid scale (SGS) models are used to account for the unresolved turbulence, i.e. the Smagorinsky model, the Wall-Adapting Local Eddy-viscosity (WALE) model, and the Vreman model. The Cumulant scheme uses an Implicit LES (ILES) subgrid scale model and shows the best agreement with the experimental data followed by MRT with WALE, and HRR with Vreman. The examined collision models are able to capture the second peak at f = 3fvs of the power spectra density of the y velocity component first discovered in experiments. With respect to performance, the collision models are compared in terms of MLUPs/node and parallel efficiency for a strong scaling analysis. Again the Cumulant scheme outperforms the other collision models even when they are run on the reduced D3Q19 stencil. All the collision schemes show a strong scaling parallel efficiency above 60% on up to 16384 cores in our implementation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据