PAL: A Variability-Aware Policy for Scheduling ML Workloads in GPU Clusters
| Title: | PAL: A Variability-Aware Policy for Scheduling ML Workloads in GPU Clusters |
|---|---|
| Authors: | Jain, Rutwik; Tran, Brandon; Chen, Keting; Sinclair, Matthew D.; Venkataraman, Shivaram |
| Source: | Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis. :1-18 |
| Availability: | http://dl.acm.org/doi/10.1109/SC41406.2024.00032 |
| Database: | ACM Full-Text Collection |