4.1 Article

WORLD: A Vocoder-Based High-Quality Speech Synthesis System for Real-Time Applications

Journal

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
Volume E99D, Issue 7, Pages 1877-1884

Publisher

IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG
DOI: 10.1587/transinf.2015EDP7457

Keywords

speech analysis; speech synthesis; vocoder; sound quality; real-time processing

Funding

  1. JSPS KAKENHI [15H02726, 26540087]
  2. Research Institute of Electrical Communication, Tohoku University [H25/A08]
  3. Grants-in-Aid for Scientific Research [26540087, 15H02726] Funding Source: KAKEN

Ask authors/readers for more resources

A vocoder-based speech synthesis system, named WORLD, was developed in an effort to improve the sound quality of real-time applications using speech. Speech analysis, manipulation, and synthesis on the basis of vocoders are used in various kinds of speech research. Although several high-quality speech synthesis systems have been developed, real-time processing has been difficult with them because of their high computational costs. This new speech synthesis system has not only sound quality but also quick processing. It consists of three analysis algorithms and one synthesis algorithm proposed in our previous research. The effectiveness of the system was evaluated by comparing its output with against natural speech including consonants. Its processing speed was also compared with those of conventional systems. The results showed that WORLD was superior to the other systems in terms of both sound quality and processing speed. In particular, it was over ten times faster than the conventional systems, and the real time factor (RTF) indicated that it was fast enough for real-time processing.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.1
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available