☆ 4.8 Article

A four-megabit compute-in-memory macro with eight-bit precision based on CMOS and resistive random-access memory for AI edge devices

NATURE ELECTRONICS (2021)

期刊

NATURE ELECTRONICS

卷 4, 期 12, 页码 921-+

出版社

NATURE PORTFOLIO

DOI: 10.1038/s41928-021-00676-9

关键词

类别

Engineering, Electrical & Electronic

资金

NVM-DTP of TSMC
TSMC-NTHU JDP
CR of TSMC
NTHU
MOST-Taiwan

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Advanced complementary metal-oxide-semiconductor technology and resistive random-access memory have been used to create a high-bit-precision compute-in-memory macro for efficient edge computing. The non-volatile computing-in-memory architecture reduces latency and energy consumption of artificial intelligence computation. The macro offers low latency and high energy efficiency for binary to 8-bit-input-8-bit-weight dot-product operations.

Advanced complementary metal-oxide-semiconductor technology and resistive random-access memory can be used to create high-bit-precision compute-in-memory macros for low latency and efficient edge computing. Non-volatile computing-in-memory (nvCIM) architecture can reduce the latency and energy consumption of artificial intelligence computation by minimizing the movement of data between the processor and memory. However, artificial intelligence edge devices with high inference accuracy require large-capacity nvCIM macros capable of high-bit-precision dot-product operations. Here we report a four-megabit nvCIM macro that combines memory cells with peripheral circuitry and is based on 22-nm-foundry binary resistive random-access memory devices and complementary metal-oxide-semiconductor (CMOS) processes. The fully CMOS-integrated macro features an asymmetrically modulated input-and-calibration scheme, a calibrated-and-weighted current-to-voltage stacking read scheme, and input-shaping hardware to overcome the challenges involved in designing large-capacity nvCIM macros with high bit precision. The macro offers latencies between 5.2 and 15.2 ns and energy efficiency between 194.4 and 15.6 tera-operations per second per watt in binary to 8-bit-input-8-bit-weight dot-product operations.

A four-megabit compute-in-memory macro with eight-bit precision based on CMOS and resistive random-access memory for AI edge devices

期刊

NATURE ELECTRONICS

出版社

NATURE PORTFOLIO

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A four-megabit compute-in-memory macro with eight-bit precision based on CMOS and resistive random-access memory for AI edge devices

期刊

NATURE ELECTRONICS

出版社

NATURE PORTFOLIO

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文