More workloads per GPU
Priority-aware fairness per workload
Predictable latency with no noisy neighbor impact