2024 Impact factor 2.8
Hadrons and Nuclei

EPJ B Colloquium: Large scale simulations on GPU clusters

Optimal partitioning for 1024 processors of an irregular domain representing a full coronary tree

Graphics Processing Units (GPU) are currently used as a cost-effective platform for computer simulations and big-data processing. Large scale applications require that multiple GPUs work together, but the efficiency obtained with cluster of GPUs is, at times, suboptimal because the GPU features are not exploited at their best.

In this EPJ B Colloquium, Massimo Bernaschi and colleagues describe how it is possible to achieve an excellent efficiency for applications in statistical mechanics, particle dynamics and networks analysis by using suitable memory access patterns and mechanisms like CUDA streams, profiling tools, etc. Similar concepts and techniques may be applied also to other problems like the solution of Partial Differential Equations.

Editors-in-Chief
David Blaschke, Silvia Leoni and Dario Vretenar
We express our heartfelt thanks for the valuable suggestions, which helped us for improving our manuscript.

K. P. Santhosh School of Pure and Applied Physics, Kannur University, Payyanur, India

ISSN (Electronic Edition): 1434-601X

© Società Italiana di Fisica and
Springer-Verlag