4.7 Article

Resistive Neural Hardware Accelerators

期刊

PROCEEDINGS OF THE IEEE
卷 111, 期 5, 页码 500-527

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JPROC.2023.3268092

关键词

Compute-in-memory (CIM); deep neural networks (DNNs); hardware acceleration; in-memory computing; processing-in-memory; resistive random access memory (ReRAM)

向作者/读者索取更多资源

Deep neural networks (DNNs) can learn real-world data and make real-time decisions, but are hindered by software and hardware limitations. Emerging nonvolatile memory (NVM) devices and the compute-in-memory (CIM) paradigm offer a new hardware architecture for efficient DNN acceleration. This survey reviews ReRAM-based DNN many-core accelerators, highlighting their superiority over CMOS counterparts and the need for new performance metrics and benchmarking standards.
Deep neural networks (DNNs), as a subset of machine learning (ML) techniques, entail that real-world data can be learned, and decisions can be made in real time. However, their wide adoption is hindered by a number of software and hardware limitations. The existing general-purpose hardware platforms used to accelerate DNNs are facing new challenges associated with the growing amount of data and are exponentially increasing the complexity of computations. Emerging nonvolatile memory (NVM) devices and the compute-in-memory (CIM) paradigm are creating a new hardware architecture generation with increased computing and storage capabilities. In particular, the shift toward resistive random access memory (ReRAM)-based in-memory computing has great potential in the implementation of area- and power-efficient inference and in training large-scale neural network architectures. These can accelerate the process of IoT-enabled AI technologies entering our daily lives. In this survey, we review the state-of-the-art ReRAM-based DNN many-core accelerators, and their superiority compared to CMOS counterparts was shown. The review covers different aspects of hardware and software realization of DNN accelerators, their present limitations, and prospects. In particular, a comparison of the accelerators shows the need for the introduction of new performance metrics and benchmarking standards. In addition, the major concerns regarding the efficient design of accelerators include a lack of accuracy in simulation tools for software and hardware codesign.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据