Speaker adaptation of an acoustic-to-articulatory inversion model using cascaded Gaussian mixture regressions - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Speaker adaptation of an acoustic-to-articulatory inversion model using cascaded Gaussian mixture regressions

Résumé

The article presents a method for adapting a GMM-based acoustic-articulatory inversion model trained on a reference speaker to another speaker. The goal is to estimate the articulatory trajectories in the geometrical space of a reference speaker from the speech audio signal of another speaker. This method is developed in the context of a system of visual biofeedback, aimed at pronunciation training. This system provides a speaker with visual information about his/her own articulation, via a 3D orofacial clone. In previous work, we proposed to use GMM-based voice conversion for speaker adaptation. Acoustic-articulatory mapping was achieved in 2 consecutive steps: 1) converting the spectral trajectories of the target speaker (i.e. the system user) into spectral trajectories of the reference speaker (voice conversion), and 2) estimating the most likely articulatory trajectories of the reference speaker from the converted spectral features (acoustic-articulatory inversion). In this work, we propose to combine these two steps into the same statistical mapping framework, by fusing multiple regressions based on trajectory GMM and maximum likelihood criterion (MLE). The proposed technique is compared to two standard speaker adaptation techniques based respectively on MAP and MLLR.
Fichier principal
Vignette du fichier
th_IS13.pdf (1.14 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00851894 , version 1 (19-08-2013)

Identifiants

  • HAL Id : hal-00851894 , version 1

Citer

Thomas Hueber, Gérard Bailly, Pierre Badin, Frédéric Elisei. Speaker adaptation of an acoustic-to-articulatory inversion model using cascaded Gaussian mixture regressions. Interspeech 2013 - 14th Annual Conference of the International Speech Communication Association, Aug 2013, Lyon, France. pp.2753-2757. ⟨hal-00851894⟩
251 Consultations
274 Téléchargements

Partager

Gmail Facebook X LinkedIn More