Systems Two Phase Cooling
Timothy Chainer, Liz Hulihan, et al.
ARPA-E COOLERCHIPS Kickoff Meeting 2023
Kubernetes has become the de facto standard for orchestrating cloud workloads, but its traditional device plugin model struggles to keep pace with the growing diversity of hardware accelerators such as GPUs, DPUs, high-speed networking devices, and emerging AI chips. Static allocation limits flexibility, resource efficiency, and multi-tenancy. This talk introduces Dynamic Resource Allocation (DRA)—a groundbreaking approach that enables fine-grained, on-demand allocation and sharing of devices across workloads, with topology-aware scheduling to optimize performance for complex hardware interconnects. We will dive into the architecture and design principles behind DRA, showcase real-world use cases, and discuss its implications for Telco, HPC and AI. Attendees will learn how DRA can unlock better utilization, scalability, and sustainability in cloud-native environments.
Timothy Chainer, Liz Hulihan, et al.
ARPA-E COOLERCHIPS Kickoff Meeting 2023
Robert Tracey, Mobayode Akinsolu, et al.
SC 2022
Robert Tracey, Ngoc Lan Hoang, et al.
ISC 2020
Claudia Misale, Daniel Milroy
KubeCon + CloudNativeCon EU 2022