Amélioration de la conversion de voix chuchotée enregistrée par capteur NAM vers la voix audible - Archive ouverte HAL Access content directly
Conference Papers Year : 2008

Amélioration de la conversion de voix chuchotée enregistrée par capteur NAM vers la voix audible

Abstract

The NAM-to-speech conversion proposed by Toda and colleagues which converts Non-Audible Murmur (NAM) to audible speech by statistical mapping trained using aligned corpora is a very promising technique, but its performance is still insufficient. In this paper, we present our current work to improve the intelligibility and the naturalness of the synthesized speech converted from whispered speech with this technique. The first system is proposed to improve F0 estimation and voicing decision. A simple neural network is used to detect voiced segments in the whisper while a GMM estimates a continuous melodic contour based on training voiced segments. In the second system, we attempt to integrate visual information for improving both spectral estimation, F0 estimation and voicing decision.
Fichier principal
Vignette du fichier
vat_JEP08.pdf (771.71 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00339058 , version 1 (15-11-2008)

Identifiers

  • HAL Id : hal-00339058 , version 1

Cite

Viet-Anh Tran, Gérard Bailly, Hélène Loevenbruck, Christian Jutten. Amélioration de la conversion de voix chuchotée enregistrée par capteur NAM vers la voix audible. JEP 2008 - 27e Journées d'Etudes sur la Parole, Jun 2008, Avignon, France. pp.110-113. ⟨hal-00339058⟩
297 View
243 Download

Share

Gmail Facebook X LinkedIn More