Neptune: Advanced ML Operator Fusion for Locality and Parallelism on GPUsYifan ZhaoEgan Johnsonet al.2026PLDI 2026Conference paper
Agile Software-Hardware Co-Design of AI-Centric Heterogeneous SoCsSarita AdveVikram Adveet al.2024ISCA 2024Tutorial