Publications

5 results for Eun K. Lee

NetZIP: Algorithm/Hardware Co-design of In-network Lossless Compression for Distributed Large Model Training
- - Jinghan Huang
  - Hyungyo Kim
  - et al.
- 2025
- MICRO 2025
Conference paper
Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference
- - Pol G. Recasens
  - Ferran Agullo
  - et al.
- 2025
- CLOUD 2025
Conference paper
Towards Pareto Optimal Throughput in Small Language Model Serving
- - Pol G. Recasens
  - Yue Zhu
  - et al.
- 2024
- EuroMLSys 2024
Conference paper
Characterizing Training Performance and Energy for Foundation Models and Image Classifiers on Multi-Instance GPUs
- - Connor Espenshade
  - Rachel Peng
  - et al.
- 2024
- EuroMLSys 2024
Conference paper
Reducing Datacenter Compute Carbon Footprint by Harnessing the Power of Specialization: Principles, Metrics, Challenges and Opportunities
- - Tamar Eilam
  - Pradip Bose
  - et al.
- 2024
- IEEE Trans Semicond Manuf
Paper