4.5 Article

PARALLEL 3D FINITE-DIFFERENCE TIME-DOMAIN METHOD ON MULTI-GPU SYSTEMS

Journal

INTERNATIONAL JOURNAL OF MODERN PHYSICS C
Volume 22, Issue 2, Pages 107-121

Publisher

WORLD SCIENTIFIC PUBL CO PTE LTD
DOI: 10.1142/S012918311101618X

Keywords

Finite-difference time-domain; graphics processing unit; convolutional perfect match layer; compute unified device architecture

Funding

  1. National Foundation of China [60877018]
  2. National Basic Research Program of China (973 Program) [2009CB930503, 2009CB930501, 2007CB613203]

Ask authors/readers for more resources

Finite-difference time-domain (FDTD) is a popular but computational intensive method to solve Maxwell's equations for electrical and optical devices simulation. This paper presents implementations of three-dimensional FDTD with convolutional perfect match layer (CPML) absorbing boundary conditions on graphics processing unit (GPU). Electromagnetic fields in Yee cells are calculated in parallel millions of threads arranged as a grid of blocks with compute unified device architecture (CUDA) programming model and considerable speedup factors are obtained versus sequential CPU code. We extend the parallel algorithm to multiple GPUs in order to solve electrically large structures. Asynchronous memory copy scheme is used in data exchange procedure to improve the computation efficiency. We successfully use this technique to simulate pointwise source radiation and validate the result by comparison to high precision computation, which shows favorable agreements. With four commodity GTX295 graphics cards on a single personal computer, more than 4000 million Yee cells can be updated in one second, which is hundreds of times faster than traditional CPU computation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available