4.6 Article Proceedings Paper

A 6 mW, 5,000-Word Real-Time Speech Recognizer Using WFST Models

Journal

IEEE JOURNAL OF SOLID-STATE CIRCUITS
Volume 50, Issue 1, Pages 102-112

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JSSC.2014.2367818

Keywords

CMOS digital integrated circuits; speech recognition; weighted finite-state transducers (WFST); Gaussian mixture models (GMM); low-power electronics

Funding

  1. Quanta Computer (via the Qmulus Project)
  2. Irwin and Joan Jacobs fellowship

Ask authors/readers for more resources

We describe an IC that provides a local speech recognition capability for a variety of electronic devices. We start with a generic speech decoder architecture that is programmable with industry-standard WFST and GMM speech models. Algorithm and architectural enhancements are incorporated in order to achieve real-time performance amid system-level constraints on internal memory size and external memory bandwidth. A 2.5 x 2.5 mm test chip implementing this architecture was fabricated using a 65 nm process. The chip performs a 5,000 word recognition task in real-time with 13.0% word error rate, 6.0 mW core power consumption, and a search efficiency of approximately 16 nJ per hypothesis.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available