Naigang Wang

Title

RSM, Manager, AI acceleration algorithm and framework

Publications

DropKV: Decoupling Residual-Output Perturbation for Near-Optimal KV-Cache Eviction
- - Aozhong Zhang
  - Selcuk Gurses
  - et al.
- 2026
- ICML 2026
Workshop paper
Is Finer Better? The Limits of Microscaling Formats in Large Language Models
- - Andrea Fasoli
  - Monodeep Kar
  - et al.
- 2026
- ICLR 2026
Conference paper
Frayed RoPE and Long Inputs: A Geometric Perspective
- - Davis Wertheimer
  - Aozhong Zhang
  - et al.
- 2026
- ICLR 2026
Conference paper
Universal Position Interpolation: Unified Context Scaling for Hybrid Mamba-Transformer Models
- - Haochen Shen
  - Davis Wertheimer
  - et al.
- 2026
- ICLR 2026
Conference paper
Advancing Fluorescence Light Detection and Ranging in Scattering Media with a Physics-Guided Mixture-of-Experts and Evidential Critics
- - Ismail Erbas
  - Ferhat Demikiran
  - et al.
- 2025
- NeurIPS 2025
Workshop paper
Accelerating LLM Inference via Dynamic KV Cache Placement in Heterogeneous Memory System
- - Yunhua Fang
  - Rui Xie
  - et al.
- 2025
- IEEE Computer Architecture Letters
Paper
CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization
- - Yanxia Deng
  - Aozhong Zhang
  - et al.
- 2025
- TMLR
Paper
Generative AI Through CAS Lens: An Integrated Overview of Algorithmic Optimizations, Architectural Advances, and Automated Designs
- - Chuan Zhang
  - You You
  - et al.
- 2025
- IEEE JESTCS
Paper
Compressed Decentralized Momentum Stochastic Gradient Methods for Nonconvex Optimization
- - Wei Liu
  - Anweshit Panda
  - et al.
- 2025
- TMLR
Paper
COMQ: A Backpropagation-Free Algorithm for Post-Training Quantization
- - Aozhong Zhang
  - Zi Yang
  - et al.
- 2025
- IEEE Access
Paper

Blog posts

Ultra-low-precision training of deep neural networks
Technical note
Naigang Wang
09 May 2019
- AI
8-bit precision for training deep learning systems
Research
Naigang Wang
03 Dec 2018
- AI
- AI Hardware

Top collaborators

Derrick Liu

Software Developer

Kaoutar El Maghraoui

Principal Research Scientist, AIU Spyre Software Ecosystem, AI Hardware Center

Raghu Kiran Ganti

Distinguished Engineer

Mudhakar Srivatsa

Distinguished Engineer, AI Platform

Naigang Wang

Title

Publications

DropKV: Decoupling Residual-Output Perturbation for Near-Optimal KV-Cache Eviction

Is Finer Better? The Limits of Microscaling Formats in Large Language Models

Frayed RoPE and Long Inputs: A Geometric Perspective

Universal Position Interpolation: Unified Context Scaling for Hybrid Mamba-Transformer Models

Advancing Fluorescence Light Detection and Ranging in Scattering Media with a Physics-Guided Mixture-of-Experts and Evidential Critics

Accelerating LLM Inference via Dynamic KV Cache Placement in Heterogeneous Memory System

CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization

Generative AI Through CAS Lens: An Integrated Overview of Algorithmic Optimizations, Architectural Advances, and Automated Designs

Compressed Decentralized Momentum Stochastic Gradient Methods for Nonconvex Optimization

COMQ: A Backpropagation-Free Algorithm for Post-Training Quantization

Patents

Very Low Precision Floating Point Representation For Deep Learning Acceleration

Mixed Precision Capable Hardware For Tuning A Machine Learning Model

Magnetic Inductor Stacks With Multilayer Isolation Layers

Laminated Magnetic Inductor Stack With High Frequency Peak Quality Factor

Stress Management For Thick Magnetic Film Inductors

Magnetic Inductor With Multiple Magnetic Layer Thicknesses

Providing Supply Voltage To A Dynamic Internal Power Supply Node

Planar Solenoid Inductors With Antiferromagnetic Pinned Cores

Resonant Clock Circuit With Magnetic Shield