4.8 Article

A four-megabit compute-in-memory macro with eight-bit precision based on CMOS and resistive random-access memory for AI edge devices

期刊

NATURE ELECTRONICS
卷 4, 期 12, 页码 921-+

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41928-021-00676-9

关键词

-

资金

  1. NVM-DTP of TSMC
  2. TSMC-NTHU JDP
  3. CR of TSMC
  4. NTHU
  5. MOST-Taiwan

向作者/读者索取更多资源

Advanced complementary metal-oxide-semiconductor technology and resistive random-access memory have been used to create a high-bit-precision compute-in-memory macro for efficient edge computing. The non-volatile computing-in-memory architecture reduces latency and energy consumption of artificial intelligence computation. The macro offers low latency and high energy efficiency for binary to 8-bit-input-8-bit-weight dot-product operations.
Advanced complementary metal-oxide-semiconductor technology and resistive random-access memory can be used to create high-bit-precision compute-in-memory macros for low latency and efficient edge computing. Non-volatile computing-in-memory (nvCIM) architecture can reduce the latency and energy consumption of artificial intelligence computation by minimizing the movement of data between the processor and memory. However, artificial intelligence edge devices with high inference accuracy require large-capacity nvCIM macros capable of high-bit-precision dot-product operations. Here we report a four-megabit nvCIM macro that combines memory cells with peripheral circuitry and is based on 22-nm-foundry binary resistive random-access memory devices and complementary metal-oxide-semiconductor (CMOS) processes. The fully CMOS-integrated macro features an asymmetrically modulated input-and-calibration scheme, a calibrated-and-weighted current-to-voltage stacking read scheme, and input-shaping hardware to overcome the challenges involved in designing large-capacity nvCIM macros with high bit precision. The macro offers latencies between 5.2 and 15.2 ns and energy efficiency between 194.4 and 15.6 tera-operations per second per watt in binary to 8-bit-input-8-bit-weight dot-product operations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据