3.8 Proceedings Paper

Trends in Analog and Digital Intensive Compute-in-SRAM Designs

With the unprecedented growth in Deep Neural Networks (DNN) model size, researchers are exploring Compute-In-Memory (CIM) designs using Static Access Random Memory (SRAM) to perform DNN computations within memory arrays in order to mitigate latency and energy overheads. These designs can realize analog or digital domain multiply-and-accumulate (MAC) computations using modulated wordline and bitline voltages and pulse-widths. Key design trends and qualitative comparisons in recent CIM-SRAM designs utilizing analog and digitally-intensive approaches are presented in this paper.
The unprecedented growth in Deep Neural Networks (DNN) model size has resulted into a massive amount of data movement from off-chip memory to on-chip processing cores in modern Machine Learning (ML) accelerators. Compute-In-Memory (CIM) designs performing DNN computations within memory arrays are being explored to mitigate this 'Memory Wall' bottleneck of latency and energy overheads. Among the incumbent embedded memories, the Static Access Random Memory (SRAM) built using high performance logic transistors and interconnects can enable custom CIM designs while offering low pJ/bit access energy, high-endurance, high-performance, and high- bandwidth. The wordline and bitline voltages and pulse-widths are modulated to realize analog or digital domain multiply-and-accumulate (MAC) computations using multiple SRAM bitcell variants. This paper describes the trends in recent CIM-SRAM designs utilizing such analog and digitally-intensive approaches. In an analog CIM-SRAM design,the inputs/activations are transformed into analog voltage or pulsewidth and applied on wordlines and/or bitlines. Multi-bit MAC computations often involve peripheral data converter circuits which need to be optimized significantly to minimize the area and energy overheads. On the other hand, digitally-intensive CIM-SRAM approaches try to avoid analog circuits by implementing smaller bit-width wordline/bitline computations and utilize sense amplifiers for performing basic logic operations and/or employ small digital logic block next to the SRAM column I/O circuits forming compute-in/near SRAM designs. Key design trends in both the approaches and qualitative comparisons are presented with a perspective on future CIM-SRAM designs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available