Adaptive SIMD optimizations in particle-in-cell codes with fine-grain particle sorting
Abstract
Particle-In-Cell (PIC) codes are broadly applied to the kinetic simulation of plasmas, from laser-matter interaction to astrophysics. Their heavy simulation cost can be mitigated by using the Single Instruction Multiple Data (SIMD) capability,or vectorization, now available on most architectures. This article details and discusses the vectorization strategy developed in the code SMILEI which takes advantage from an efficient, systematic, cell-based sorting of the particles. The PIC operators on particles (projection, push, deposition) have been optimized to benefit from large SIMD vectors on both recent and older architectures. The efficiency of these vectorized operations increases with the number of particles per cell (PPC), typically speeding up three-dimensional simulations by a factor 2 with 256 PPC. Although this implementation shows acceleration from as few as 8 PPC, it can be slower than the scalar version in domains containing fewer PPC as usually observed in vectorization attempts. This issue is overcome with an adaptive algorithm which switches locally between scalar (for few PPC) and vectorized operators (otherwise). The newly implemented methods are benchmarked on three different, large-scale simulations considering configurations frequently studied with PIC codes.
- Publication:
-
Computer Physics Communications
- Pub Date:
- November 2019
- DOI:
- arXiv:
- arXiv:1810.03949
- Bibcode:
- 2019CoPhC.244..246B
- Keywords:
-
- Particle-in-cell (PIC);
- Sorting;
- SIMD vectorization;
- Plasma physics;
- Physics - Computational Physics
- E-Print:
- doi:10.1016/j.cpc.2019.05.001