4.6 Article

A Resource-Efficient Keyword Spotting System Based on a One-Dimensional Binary Convolutional Neural Network

期刊

ELECTRONICS
卷 12, 期 18, 页码 -

出版社

MDPI
DOI: 10.3390/electronics12183964

关键词

keyword spotting; convolutional neural networks; binarized neural networks; inference; processor; field-programmable gate arrays

向作者/读者索取更多资源

This paper proposes a resource-efficient keyword spotting system based on a convolutional neural network. The system utilizes a one-dimensional CNN for end-to-end keyword recognition, achieving high accuracy and fast processing speed. By binarizing the model and employing a dedicated engine, the system reduces resource usage without sacrificing performance. Experimental results show that the system achieves a processing latency of 22 ms and a spotting accuracy of 91.80% in a specific environment.
This paper proposes a resource-efficient keyword spotting (KWS) system based on a convolutional neural network (CNN). The end-to-end KWS process is performed based solely on 1D-CNN inference, where features are first extracted from a few convolutional blocks, and then the keywords are classified using a few fully connected blocks. The 1D-CNN model is binarized to reduce resource usage, and its inference is executed by employing a dedicated engine. This engine is designed to skip redundant operations, enabling high inference speed despite its low complexity. The proposed system is implemented using 6895 ALUTs in an Intel Cyclone V FPGA by integrating the essential components for performing the KWS process. In the system, the latency required to process a frame is 22 ms, and the spotting accuracy is 91.80% in an environment where the signal-to-noise ratio is 10 dB for Google speech commands dataset version 2.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据