4.6 Article

A Cost-Efficient High-Speed VLSI Architecture for Spiking Convolutional Neural Network Inference Using Time-Step Binary Spike Maps

期刊

SENSORS
卷 21, 期 18, 页码 -

出版社

MDPI
DOI: 10.3390/s21186006

关键词

neuromorphic computing; spiking convolutional neural networks; SNN hardware; VLSI implementation; pixel stream processing

资金

  1. Key Project of Chongqing Science and Technology Foundation [cstc2019jcyj-zdxmX0017]
  2. Chongqing Talents Plan for Yong Talents [CQYC201905015]
  3. State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences [CARCH201908]

向作者/读者索取更多资源

Neuromorphic hardware systems utilizing spiking neural networks have gained attention for embedded applications due to their brain-inspired, energy-efficient design. This paper introduces a novel VLSI architecture for accelerating deep SCNN inference in real-time low-cost embedded scenarios, achieving high processing speed and recognition accuracies on image datasets. The architecture leverages snapshot processing and fine-grained data pipelines to achieve high throughput, demonstrating the feasibility of SCNN hardware for various embedded applications.
Neuromorphic hardware systems have been gaining ever-increasing focus in many embedded applications as they use a brain-inspired, energy-efficient spiking neural network (SNN) model that closely mimics the human cortex mechanism by communicating and processing sensory information via spatiotemporally sparse spikes. In this paper, we fully leverage the characteristics of spiking convolution neural network (SCNN), and propose a scalable, cost-efficient, and high-speed VLSI architecture to accelerate deep SCNN inference for real-time low-cost embedded scenarios. We leverage the snapshot of binary spike maps at each time-step, to decompose the SCNN operations into a series of regular and simple time-step CNN-like processing to reduce hardware resource consumption. Moreover, our hardware architecture achieves high throughput by employing a pixel stream processing mechanism and fine-grained data pipelines. Our Zynq-7045 FPGA prototype reached a high processing speed of 1250 frames/s and high recognition accuracies on the MNIST and Fashion-MNIST image datasets, demonstrating the plausibility of our SCNN hardware architecture for many embedded applications.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据