A fully parallel, high precision, N-body code running on hybrid computing platforms
Abstract
We present a new implementation of the numerical integration of the classical, gravitational, N-body problem based on a high order Hermite's integration scheme with block time steps, with a direct evaluation of the particle-particle forces. The main innovation of this code (called HiGPUs) is its full parallelization, exploiting both OpenMP and MPI in the use of the multicore Central Processing Units as well as either Compute Unified Device Architecture (CUDA) or OpenCL for the hosted Graphic Processing Units. We tested both performance and accuracy of the code using up to 256 GPUs in the supercomputer IBM iDataPlex DX360M3 Linux Infiniband Cluster provided by the Italian supercomputing consortium CINECA, for values of N⩽8 millions. We were able to follow the evolution of a system of 8 million bodies for few crossing times, task previously unreached by direct summation codes.
- Publication:
-
Journal of Computational Physics
- Pub Date:
- March 2013
- DOI:
- 10.1016/j.jcp.2012.11.013
- arXiv:
- arXiv:1207.2367
- Bibcode:
- 2013JCoPh.236..580C
- Keywords:
-
- Astrophysics - Instrumentation and Methods for Astrophysics;
- Computer Science - Distributed;
- Parallel;
- and Cluster Computing;
- Physics - Computational Physics
- E-Print:
- Paper submitted to Journal of Computational Physics consisting in 28 pages, 9 figures.The previous submitted version was lacking of the bibliography, for a Tex problem