Fifteenth International Workshop on
Parallel Programming Models and Systems Software for High-End Computing (P2S2), 2022 To be held in conjunction with ICPP 2022: The 51st International Conference on Parallel Processing August 29th to Sept 1st, 2022 in Bordeaux, France |
||
Abstract - Analysis and optimization of data transfer in Multi-GPU Python applications
Python is becoming increasingly popular, even in parallel and high-performance computing, although performance is often worse than compiled languages. In our previous work, we looked at Cuda Numba and showed that it can achieve good performance for single kernels. We are now looking at multi-GPU applications that require data exchange. We show that by using stream-aware communication, as enabled by NCCL, performance can be many times better than with MPI, where the Python interpreter poses a significant performance problem.
Biography - Lena Oden
Lena Oden is a professor for Computer Engeneering at the FernUniversität in Hagen and scientist at the Jülich Supercomuting Center.
Her research interests are programming models and runtime systems, with a special interest in (multi-) GPU computing and the design and implementation of federated research infrastructures, combining the benefits of cloud and HPC computing.