A Multilinear Tongue Model Derived from Speech Related MRI Data of the Human Vocal Tract

Abstract : We present a multilinear statistical model of the human tongue that captures anatomical and tongue pose related shape variations separately. The model was derived from 3D magnetic resonance imaging data of 11 speakers sustaining speech related vocal tract configurations. The extraction was performed by using a minimally supervised method that uses as basis an image segmentation approach and a template fitting technique. Furthermore, it uses image denoising to deal with possibly corrupt data, palate surface information reconstruction to handle palatal tongue contacts, and a bootstrap strategy to refine the obtained shapes. Our experiments concluded that limiting the degrees of freedom for the anatomical and speech related variations to 5 and 4 respectively produces a model that can reliably register unknown data while avoiding overfitting effects.
Type de document :
Pré-publication, Document de travail
2016
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01418460
Contributeur : Alexander Hewer <>
Soumis le : vendredi 16 décembre 2016 - 17:18:30
Dernière modification le : jeudi 23 février 2017 - 22:17:11
Document(s) archivé(s) le : mardi 21 mars 2017 - 11:38:22

Fichiers

article.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01418460, version 1
  • ARXIV : 1612.05005

Collections

Citation

Alexander Hewer, Stefanie Wuhrer, Ingmar Steiner, Korin Richmond. A Multilinear Tongue Model Derived from Speech Related MRI Data of the Human Vocal Tract. 2016. <hal-01418460>

Partager

Métriques

Consultations de
la notice

169

Téléchargements du document

207