4.7 Article

Accelerating nbody6 with graphics processing units

Journal

MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY
Volume 424, Issue 1, Pages 545-552

Publisher

WILEY-BLACKWELL
DOI: 10.1111/j.1365-2966.2012.21227.x

Keywords

methods: numerical; globular clusters: general

Funding

  1. STFC [ST/H004912/1] Funding Source: UKRI
  2. Science and Technology Facilities Council [ST/H004912/1] Funding Source: researchfish

Ask authors/readers for more resources

We describe the use of graphics processing units (GPUs) for speeding up the code nbody6 which is widely used for direct N-body simulations. Over the years, the N2 nature of the direct force calculation has proved a barrier for extending the particle number. Following an early introduction of force polynomials and individual time steps, the calculation cost was first reduced by the introduction of a neighbour scheme. After a decade of GRAPE computers which speeded up the force calculation further, we are now in the era of GPUs where relatively small hardware systems are highly cost effective. A significant gain in efficiency is achieved by employing the GPU to obtain the so-called regular force which typically involves some 99 per cent of the particles, while the remaining local forces are evaluated on the host. However, the latter operation is performed up to 20 times more frequently and may still account for a significant cost. This effort is reduced by parallel SSE/AVX procedures where each interaction term is calculated using mainly single precision. We also discuss further strategies connected with coordinate and velocity prediction required by the integration scheme. This leaves hard binaries and multiple close encounters which are treated by several regularization methods. The present nbody6gpu code is well balanced for simulations in the particle range 1042 x 105 for a dual-GPU system attached to a standard PC.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available