Can tongue be recovered from face? The answer of data-driven statistical models - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Can tongue be recovered from face? The answer of data-driven statistical models

Résumé

This study revisits the face-to-tongue articulatory inversion problem in speech. We compare the Multi Linear Regression method (MLR) with two more sophisticated methods based on Hidden Markov Models (HMMs) and Gaussian Mixture Models (GMMs), using the same French corpus of articulatory data acquired by ElectroMagnetoGraphy. GMMs give overall results better than HMMs, but MLR does poorly. GMMs and HMMs maintain the original phonetic class distribution, though with some centralisation effects, effects still much stronger with MLR. A detailed analysis shows that, if the jaw / lips / tongue tip synergy helps recovering front high vowels and coronal consonants, the velars are not recovered at all. It is therefore not possible to recover reliably tongue from face.
Fichier non déposé

Dates et versions

hal-00508276 , version 1 (02-08-2010)

Identifiants

  • HAL Id : hal-00508276 , version 1

Citer

Atef Ben Youssef, Pierre Badin, Gérard Bailly. Can tongue be recovered from face? The answer of data-driven statistical models. Interspeech 2010 - 11th Annual Conference of the International Speech Communication Association, Sep 2010, Makuhari, Japan. pp.2002-2005. ⟨hal-00508276⟩
105 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More