4.7 Article

High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster

Journal

JOURNAL OF COMPUTATIONAL PHYSICS
Volume 229, Issue 20, Pages 7692-7714

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jcp.2010.06.024

Keywords

GPU computing; Finite elements; Spectral elements; Seismic modeling; CUDA; MPI; Speedup; Cluster

Funding

  1. French ANR [ANR-05-CIGC-002]
  2. French CNRS
  3. INRIA
  4. IUF
  5. German Deutsche Forschungsgemeinschaft [TU102/22-1, TU102/22-2]
  6. German BMBF [011H08003D]

Ask authors/readers for more resources

We implement a high-order finite-element application, which performs the numerical simulation of seismic wave propagation resulting for instance from earthquakes at the scale of a continent or from active seismic acquisition experiments in the oil industry, on a large cluster of NVIDIA Tesla graphics cards using the CUDA programming environment and non-blocking message passing based on MPI. Contrary to many finite-element implementations, ours is implemented successfully in single precision, maximizing the performance of current generation GPUs. We discuss the implementation and optimization of the code and compare it to an existing very optimized implementation in C language and MPI on a classical cluster of CPU nodes. We use mesh coloring to efficiently handle summation operations over degrees of freedom on an unstructured mesh, and non-blocking MPI messages in order to overlap the communications across the network and the data transfer to and from the device via PCIe with calculations on the GPU. We perform a number of numerical tests to validate the single-precision CUDA and MPI implementation and assess its accuracy. We then analyze performance measurements and depending on how the problem is mapped to the reference CPU cluster, we obtain a speedup of 20x or 12x. (C) 2010 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available