Fifteenth International Workshop on
Parallel Programming Models and Systems Software for
High-End Computing (P2S2), 2022

To be held in conjunction with
ICPP 2022: The 51st International Conference on Parallel Processing
August 29th to Sept 1st, 2022 in Bordeaux, France

Abstract - Analysis and optimization of data transfer in Multi-GPU Python applications

Python is becoming increasingly popular, even in parallel and high-performance computing, although performance is often worse than compiled languages. In our previous work, we looked at Cuda Numba and showed that it can achieve good performance for single kernels. We are now looking at multi-GPU applications that require data exchange. We show that by using stream-aware communication, as enabled by NCCL, performance can be many times better than with MPI, where the Python interpreter poses a significant performance problem.

Biography - Lena Oden

Lena Oden is a professor for Computer Engeneering at the FernUniversität in Hagen and scientist at the Jülich Supercomuting Center.

Her research interests are programming models and runtime systems, with a special interest in (multi-) GPU computing and the design and implementation of federated research infrastructures, combining the benefits of cloud and HPC computing.