Spectral envelope recovery beyond the Nyquist limit for high-quality manipulation of speech sounds

Hideki Kawahara, Masanori Morise, Hideki Banno, Toru Takahashi, Ryuichi Nisimura, Toshio Irino

Research output: Contribution to journalConference articlepeer-review

4 Citations (Scopus)

Abstract

A simple new method to recover details in a spectral envelope is proposed based on a recently introduced speech analysis, modification and resynthesis framework called TANDEM-STRAIGHT. Spectral envelope recovery of voiced sounds is a discrete-to-analog conversion in the frequency domain. However, there is a fundamental problem because the spatial frequency contents of vocal tract functions generally exceed the Nyquist limit of the equivalent sampling rate determined by the fundamental frequency. TANDEM-STRAIGHT yields a method to recover a spectral envelope based on the consistent sampling theory and provides base information for exceeding this limit. At the final stage, the AR spectral envelope estimated from the TANDEM-STRAIGHT spectrum is divided by the F0 adaptively smoothed version of itself to supply the missing high-spatial- frequency details of the envelope. The underlying principle of the proposed method can also be applied to other speech synthesis frameworks.

Original languageEnglish
Pages (from-to)650-653
Number of pages4
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publication statusPublished - 2008
EventINTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association - Brisbane, QLD, Australia
Duration: 22 Sep 200826 Sep 2008

Keywords

  • Sampling theory
  • Speech analysis
  • Speech modification

Fingerprint

Dive into the research topics of 'Spectral envelope recovery beyond the Nyquist limit for high-quality manipulation of speech sounds'. Together they form a unique fingerprint.

Cite this