A modulation property of time-frequency derivatives of filtered phase and its application to aperiodicity and fo estimation

Hideki Kawahara, Ken Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda

Research output: Contribution to journalConference articlepeer-review

2 Citations (Scopus)

Abstract

We introduce a simple and linear SNR (strictly speaking, periodic to random power ratio) estimator (0 dB to 80 dB without additional calibration/linearization) for providing reliable descriptions of aperiodicity in speech corpus. The main idea of this method is to estimate the background random noise level without directly extracting the background noise. The proposed method is applicable to a wide variety of time windowing functions with very low sidelobe levels. The estimate combines the frequency derivative and the time-frequency derivative of the mapping from filter center frequency to the output instantaneous frequency. This procedure can replace the periodicity detection and aperiodicity estimation subsystems of recently introduced open source vocoder, YANG vocoder. Source code of MATLAB implementation of this method will also be open sourced.

Original languageEnglish
Pages (from-to)424-428
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2017-August
DOIs
Publication statusPublished - 2017
Event18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 - Stockholm, Sweden
Duration: 20 Aug 201724 Aug 2017

Keywords

  • Aperiodicity
  • Fundamental frequency
  • Group delay
  • Instantaneous frequency
  • Speech analysis

Fingerprint

Dive into the research topics of 'A modulation property of time-frequency derivatives of filtered phase and its application to aperiodicity and fo estimation'. Together they form a unique fingerprint.

Cite this