Analog AI as a Service: A Cloud Platform for In-Memory Computing
Kaoutar El Maghraoui, Kim Tran, et al.
SSE 2024
Analogue in-memory computing (AIMC) is an emerging computational paradigm that can efficiently accelerate the key operations in deep learning (DL) inference workloads. Heterogeneous architectures, which integrate both AIMC tiles and digital processing units, have been proposed to enable the end-to-end execution of various deep neural network models. However, developing a software stack for these architectures is challenging, owing to their distinct characteristics — such as the need for extensive or complete weight stationarity and pipelined execution across layers, if maximum performance is to be achieved. Moreover, AIMC tiles are inherently stochastic and hence introduce a combination of stochastic and deterministic noise, which adversely affects accuracy. As a result, existing tools for software stack development are not directly applicable. In this Perspective, we give an overview of the key attributes of DL software stacks and AIMC-based accelerators, outline the challenges associated with designing DL software stacks for AIMC-based accelerators and present opportunities for future research.
Kaoutar El Maghraoui, Kim Tran, et al.
SSE 2024
Chander Govindarajan, Priyanka Naik, et al.
CLOUD 2024
Pritish Parida, Timothy Chainer, et al.
ARPA-E Summit 2023
Matteo Manica, Raphael Polig, et al.
IEEE/ACM TCBB