AI hardware acceleration with analog memory: Microarchitectures for low energy at high speed

H. Y. Chang; Geoffrey W. Burr; Pritish Narayanan; Scott C. Lewis; N. C.P. Farinha; Kohji Hosokawa; Charles Mackin; Hsinyu Tsai; Stefano Ambrogio; An Chen

doi:10.1147/JRD.2019.2934050

IBM J. Res. Dev

Paper

01 Nov 2019

AI hardware acceleration with analog memory: Microarchitectures for low energy at high speed

View publication

Abstract

In this article, we present innovative microarchitectural designs for multilayer deep neural networks (DNNs) implemented in crossbar arrays of analog memories. Data is transferred in a fully parallel manner between arrays without explicit analog-to-digital converters. Design ideas including source follower-based readout, array segmentation, and transmit-by-duration are adopted to improve the circuit efficiency. The execution energy and throughput, for both DNN training and inference, are analyzed quantitatively using circuit simulations of a full CMOS design in the 90-nm technology node. We find that our current design could achieve up to 12-14 TOPs/s/W energy efficiency for training, while a projected scaled design could achieve up to 250 TOPs/s/W. Key challenges in realizing analog AI systems are discussed.

Paper

Noise reduction of page-oriented data storage by inverse filtering during recording

Geoffrey W. Burr, Hans Coufal, et al.

Optics Letters

Paper

Nanoscale nuclei in phase change materials: Origin of different crystallization mechanisms of Ge₂Sb₂Te₅ and AgInSbTe

Bong-Sub Lee, Robert M. Shelby, et al.

Journal of Applied Physics

Conference paper

Improved deep neural network hardware-Accelerators based on non-volatile-memory: The local gains technique

Irem Boybat, Carmelo Di Nolfo, et al.

ICRC 2017

Paper

On the origin of steep i - V nonlinearity in mixed-ionic-electronic-conduction-based access devices

Alvaro Padilla, Geoffrey W. Burr, et al.

IEEE T-ED

View all publications

Abstract

Related

Noise reduction of page-oriented data storage by inverse filtering during recording

Nanoscale nuclei in phase change materials: Origin of different crystallization mechanisms of Ge2Sb2Te5 and AgInSbTe

Improved deep neural network hardware-Accelerators based on non-volatile-memory: The local gains technique

On the origin of steep i - V nonlinearity in mixed-ionic-electronic-conduction-based access devices

Nanoscale nuclei in phase change materials: Origin of different crystallization mechanisms of Ge₂Sb₂Te₅ and AgInSbTe