Transformer Explainer: Learning LLM Transformers with Interactive Visual Explanation and Experimentation

Aeree Cho; Grace Kim; Alexander Karpekov; Seongmin Lee; Alec Helbling; Benjamin Hoover; Zijie Wang; Minsuk Kang; Polo Chau

CHI 2026

Conference paper

13 Apr 2026

Transformer Explainer: Learning LLM Transformers with Interactive Visual Explanation and Experimentation

View publication

Abstract

The Transformer architecture underpins modern large language models powering state-of-the-art text generation and AI applications. However, its complexity makes it difficult for non-experts to learn. Existing resources often lack interactivity, rely on static descriptions of simplified architectures, or fail to reflect models’ behavior with real data. To address this gap, we introduce Transformer Explainer, an interactive visualization tool for non-experts to learn Transformers. The tool integrates an overview illustrating the Transformer’s data flow with on-demand explanations that gradually reveal mathematical details. Smooth transitions across abstraction levels highlight the interplay between high-level structures and low-level operations. Running a live GPT-2 instance directly in the browser, Transformer Explainer empowers learners to experiment with custom input and hyperparameters without setup, observing next-token predictions in real time. A 90-participant user study showed that our tool offered significant advantages in improving user understanding and engagement. Transformer Explainer has attracted over 490,000 users.

Talk

Dataset of Reticular Materials' Syntheses Automatically Created from PDFs by using LLMs

Viviane T. Silva, Rodrigo Neumann Barros Ferreira, et al.

ACS Fall 2024

Poster

Triton vs. Halide: Exploring Coupled and Decoupled Machine Learning Kernel Languages

Quinn Pham, Danila Seliayeu, et al.

CASCON 2024

Paper

PaccMann^RL: De novo generation of hit-like anticancer molecules from transcriptomic data via reinforcement learning

Jannis Born, Matteo Manica, et al.

iScience

Workshop poster

SubsetGAN: Pattern detection in the activation space for Identifying Synthesised Content

Celia Cintas, Skyler Speakman, et al.

ICML 2021

View all publications

Abstract

Related

Dataset of Reticular Materials' Syntheses Automatically Created from PDFs by using LLMs

Triton vs. Halide: Exploring Coupled and Decoupled Machine Learning Kernel Languages

PaccMannRL: De novo generation of hit-like anticancer molecules from transcriptomic data via reinforcement learning

SubsetGAN: Pattern detection in the activation space for Identifying Synthesised Content

PaccMann^RL: De novo generation of hit-like anticancer molecules from transcriptomic data via reinforcement learning