Mapping de l'espace spectral vers l'espace visuel de la parole: Les voyelles du Français en Langue Française Parlée Complétée

Zuheng Ming 1 Gang Feng 2 Denis Beautemps 2
2 GIPSA-MAGIC - MAGIC
GIPSA-DPC - Département Parole et Cognition
Abstract : In this paper, we present a statistical method based on GMM modeling to map the acoustic speech spectral features to visual features of Cued Speech in the sense of least square error in a low signal level which is innovative and different with the classic text-to-visual approach. In comparison with the GMM based mapping modeling we first present the results with the use of a multi-linear model also at the low signal level and study the limitation of the approach. The experimental results demonstrate that the GMM based mapping method can significant improve the mapping performance compared with the multi-linear based mapping model especial in the sense of the weak linear correlation between the target and the predictor such as the hand positions of Cued Speech and the acoustic speech spectral features.
Document type :
Conference papers
Complete list of metadatas

Cited literature [5 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00700406
Contributor : Denis Beautemps <>
Submitted on : Tuesday, May 22, 2012 - 6:55:30 PM
Last modification on : Monday, April 9, 2018 - 12:22:33 PM
Long-term archiving on : Thursday, August 23, 2012 - 2:41:16 AM

File

Ming_Jep2012_Revised_v2.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00700406, version 1

Citation

Zuheng Ming, Gang Feng, Denis Beautemps. Mapping de l'espace spectral vers l'espace visuel de la parole: Les voyelles du Français en Langue Française Parlée Complétée. 14ème édition des Rencontres des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (JEP-TALN-RECITAL'2012), Jun 2012, Grenoble, France. pp.73-80. ⟨hal-00700406⟩

Share

Metrics

Record views

283

Files downloads

999