| Title: |
Performance and Programmability of MPI+X Integration with CUDA, HIP, SYCL, OpenACC, and OpenMP Offloading for Supercomputing: A Case Study on Dense Matrix–Vector Multiplication |
| Authors: |
Krishnasamy, Ezhilmathi; Throtter, James; Cai, Xing; Pleiter, Dirk; Kos, Leon; Saavedra, Laura; Bouvry, Pascal |
| Source: |
Proceedings of the Supercomputing Asia and International Conference on High Performance Computing in Asia Pacific Region Workshops. :457-468 |
| Availability: |
http://dl.acm.org/doi/10.1145/3784828.3786264 |
| Database: |
ACM Full-Text Collection |