Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase response compensation

Hideki Kawahara, Masanori Morise, Tomoki Toda, Hideki Banno, Ryuichi Nisimura, Toshio Irino

Research output: Contribution to journalConference articlepeer-review

5 Citations (Scopus)

Abstract

A group delay-based excitation source analysis and design method is introduced for extension of TANDEM-STRAIGHT, a speech analysis, modification and synthesis system. This introduction makes all components of the system be based on interference-free representations. They are power spectrum, instantaneous frequency and group delay representations. This unification has potential to solve the major weak point of VOCODER architecture for high-quality speech manipulation applications.

Original languageEnglish
Pages (from-to)2243-2247
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publication statusPublished - 1 Jan 2014
Event15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014 - Singapore, Singapore
Duration: 14 Sep 201418 Sep 2014

Keywords

  • Interference-free representations
  • Speech analysis
  • Speech modification
  • Speech synthesis
  • VOCODER

Fingerprint

Dive into the research topics of 'Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase response compensation'. Together they form a unique fingerprint.

Cite this