Swagath Venkataramani

Title

Principal Research Scientist, AIU Architecture and Compilers

Publications

Is Finer Better? The Limits of Microscaling Formats in Large Language Models
- - Andrea Fasoli
  - Monodeep Kar
  - et al.
- 2026
- ICLR 2026
Conference paper
Eliminating Redundancy: Ultra-compact Code Generation for Programmable Dataflow Accelerators
- - Prasanth Chatarasi
  - Alex Gatea
  - et al.
- 2026
- CGO 2026
Conference paper
Spyre: An inference-optimized scalable AI accelerator for enterprise workloads
- - Matt Cohen
  - Monodeep Kar
  - et al.
- 2026
- ISSCC 2026
Conference paper
Enabling Spill-Free Compilation via Affine-Based Live Range Reduction Optimization
- - Prasanth Chatarasi
  - Alex Gatea
  - et al.
- 2026
- CGO 2026
Conference paper
DeepTools: A Full-Stack Machine Learning Compiler for the IBM Spyre Accelerator
- - Prasanth Chatarasi
  - Shubham Jain
  - et al.
- 2026
- CGO 2026
Workshop paper
Breaking the HBM Bit Cost Barrier: Domain-Specific ECC for AI Inference Infrastructure
- - Rui Xie
  - Asad Ul Haq
  - et al.
- 2025
- IEEE Computer Architecture Letters
Paper
MixTrain: accelerating DNN training via input mixing
- - Sarada Krithivasan
  - Sanchari Sen
  - et al.
- 2024
- Frontiers in Artificial Intelligence
Paper
A Software-Assisted Peak Current Regulation Scheme to Improve Power-Limited Inference Performance in a 5nm AI SoC
- - Monodeep Kar
  - Joel Silberman
  - et al.
- 2024
- ISSCC 2024
Conference paper
DNNDaSher: A Compiler Framework for Dataflow Compatible End-to-End Acceleration on IBM AIU
- - Sanchari Sen
  - Shubham Jain
  - et al.
- 2024
- IEEE Micro
Paper
Power-Limited Inference Performance Optimization Using a Software-Assisted Peak Current Regulation Scheme in a 5-nm AI SoC
- - Monodeep Kar
  - Joel Silberman
  - et al.
- 2024
- IEEE Journal of Solid-State Circuits
Paper

Top collaborators

Alberto Mannari

Software Developer

Prasanth Chatarasi

Senior Research Scientist, AI Accelerator Compilers and Architecture

Matthew Ziegler

Principal Research Scientist

Paul G Crumley

STSM, AI & Hybrid Cloud Infrastructure

Swagath Venkataramani

Title

Publications

Is Finer Better? The Limits of Microscaling Formats in Large Language Models

Eliminating Redundancy: Ultra-compact Code Generation for Programmable Dataflow Accelerators

Spyre: An inference-optimized scalable AI accelerator for enterprise workloads

Enabling Spill-Free Compilation via Affine-Based Live Range Reduction Optimization

DeepTools: A Full-Stack Machine Learning Compiler for the IBM Spyre Accelerator

Breaking the HBM Bit Cost Barrier: Domain-Specific ECC for AI Inference Infrastructure

MixTrain: accelerating DNN training via input mixing

A Software-Assisted Peak Current Regulation Scheme to Improve Power-Limited Inference Performance in a 5nm AI SoC

DNNDaSher: A Compiler Framework for Dataflow Compatible End-to-End Acceleration on IBM AIU

Power-Limited Inference Performance Optimization Using a Software-Assisted Peak Current Regulation Scheme in a 5-nm AI SoC

Patents

Reformatting Of Tensors To Provide Sub-tensors

Sparse Systolic Array Design

Exploiting Fine-grained Structured Weight Sparsity In Systolic Arrays

Programmable Data Delivery To A System Of Shared Processing Elements With Shared Memory

Reducing The Cost Of N Modular Redundancy For Neural Networks

Hybrid Data-model Parallelism For Efficient Deep Learning

System-aware Selective Quantization For Performance Optimized Distributed Deep Learning

Low Precision Deep Neural Network Enabled By Compensation Instructions