4.5 Article

Device-Circuit-Architecture Co-Exploration for Computing-in-Memory Neural Accelerators

期刊

IEEE TRANSACTIONS ON COMPUTERS
卷 70, 期 4, 页码 595-605

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TC.2020.2991575

关键词

Computer architecture; Hardware; Neural networks; Performance evaluation; Optimization; Object recognition; Quantization (signal); Hardware; software co-design; computing-in-memory architecture; neural architecture search; neural network accelerator

资金

  1. US National Science Foundation [CNS-1822099, SPX-1919167, CNS-1629914, CCF-1820537]

向作者/读者索取更多资源

The article discusses the co-exploration of neural architectures and hardware design, proposing the NACIM framework, which can find robust neural networks and achieve high energy-efficiency performance while considering device variation.
Co-exploration of neural architectures and hardware design is promising due to its capability to simultaneously optimize network accuracy and hardware efficiency. However, state-of-the-art neural architecture search algorithms for the co-exploration are dedicated for the conventional von-Neumann computing architecture, whose performance is heavily limited by the well-known memory wall. In this article, we are the first to bring the computing-in-memory architecture, which can easily transcend the memory wall, to interplay with the neural architecture search, aiming to find the most efficient neural architectures with high network accuracy and maximized hardware efficiency. Such a novel combination makes opportunities to boost performance, but also brings a bunch of challenges: The optimization space spans across multiple design layers from device type and circuit topology to neural architecture; and the presence of device variation may drastically degrade the neural network performance. To address these challenges, we propose a cross-layer exploration framework, namely NACIM, which jointly explores device, circuit and architecture design space and takes device variation into consideration to find the most robust neural architectures, coupled with the most efficient hardware design. Experimental results demonstrate that NACIM can find the robust neural network with 0.45 percent accuracy loss in the presence of device variation, compared with a 76.44 percent loss from the state-of-the-art NAS without consideration of variation; in addition, NACIM achieves an energy efficiency up to 16.3 TOPs/W, 3.17x higher than the state-of-the-art NAS.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据