A Multilinear Tongue Model Derived from Speech Related MRI Data of the Human Vocal Tract

We present a multilinear statistical model of the human tongue that captures anatomical and tongue pose related shape variations separately. The model was derived from 3D magnetic resonance imaging data of 11 speakers sustaining speech related vocal tract configurations. The extraction was performed by using a minimally supervised method that uses as basis an image segmentation approach and a template fitting technique. Furthermore, it uses image denoising to deal with possibly corrupt data, palate surface information reconstruction to handle palatal tongue contacts, and a bootstrap strategy to refine the obtained shapes. Our experiments concluded that limiting the degrees of freedom for the anatomical and speech related variations to 5 and 4 respectively produces a model that can reliably register unknown data while avoiding overfitting effects.

Mots clés

tongue vocal tract MRI statistical model shape analysis

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

article.pdf (9.34 Mo)

compactness_phoneme.pdf (4.79 Ko)

compactness_speaker.pdf (4.7 Ko)

fixed_phone_specificity.pdf (23.24 Ko)

generalization_phoneme.pdf (5.12 Ko)

generalization_speaker.pdf (5 Ko)

specificity_combined_phoneme.pdf (5.15 Ko)

specificity_combined_speaker.pdf (5.07 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Alexander Hewer : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01418460

Soumis le : vendredi 16 décembre 2016-17:18:30

Dernière modification le : mercredi 3 avril 2024-12:50:03

Archivage à long terme le : mardi 21 mars 2017-11:38:22

Dates et versions

hal-01418460 , version 1 (16-12-2016)

hal-01418460 , version 2 (14-04-2018)

Identifiants

HAL Id : hal-01418460 , version 1
ARXIV : 1612.05005

Citer

Alexander Hewer, Stefanie Wuhrer, Ingmar Steiner, Korin Richmond. A Multilinear Tongue Model Derived from Speech Related MRI Data of the Human Vocal Tract. 2016. ⟨hal-01418460v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

411 Consultations

460 Téléchargements