Journal
INTERNATIONAL JOURNAL OF COMPUTATIONAL FLUID DYNAMICS
Volume 31, Issue 9, Pages 396-411Publisher
TAYLOR & FRANCIS LTD
DOI: 10.1080/10618562.2017.1390084
Keywords
Portability; heterogeneous computing; GPU; MPI plus CUDA; openCL; sliced ELLPACK; CFD code
Categories
Funding
- Ministerio de Ciencia e Innovacion, Spain [ENE-2014-60577-R]
- Russian Science Foundation [15-11-30039]
- CONICYT Becas Chile [IJCI-2014-21034]
- Initial Training Network SEDITRANS [607394]
- Russian Science Foundation [15-11-30039] Funding Source: Russian Science Foundation
Ask authors/readers for more resources
Nowadays, high performance computing (HPC) systems experience a disruptive moment with a variety of novel architectures and frameworks, without any clarity of which one is going to prevail. In this context, the portability of codes across different architectures is of major importance. This paper presents a portable implementation model based on an algebraic operational approach for direct numerical simulation (DNS) and large eddy simulation (LES) of incompressible turbulent flows using unstructured hybrid meshes. The strategy proposed consists in representing the whole time-integration algorithm using only three basic algebraic operations: sparse matrix-vector product, a linear combination of vectors and dot product. The main idea is based on decomposing the nonlinear operators into a concatenation of two SpMV operations. This provides high modularity and portability. An exhaustive analysis of the proposed implementation for hybrid CPU/GPU supercomputers has been conducted with tests using up to 128 GPUs. The main objective consists in understanding the challenges of implementing CFD codes on new architectures.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available