An audiovisual talking head for augmented speech generation: models and animations based on a real speaker's articulatory data

Pierre Badin; Frédéric Elisei; Gérard Bailly; Yuliya Tarabalka

doi:10.1007/978-3-540-70517-8_14

Chapitre D'ouvrage Année : 2008

An audiovisual talking head for augmented speech generation: models and animations based on a real speaker's articulatory data

(1) , (2) , (1) , (1)

1
2

Pierre Badin

Fonction : Auteur
PersonId : 4918
IdHAL : pierrebadin
ORCID : 0000-0001-7440-820X
IdRef : 117976687

GIPSA - Machines Parlantes, Agents Communicants & Interaction Face-à-face

Frédéric Elisei

Fonction : Auteur
PersonId : 17769
IdHAL : frederic-elisei
ORCID : 0000-0002-1295-3445

GIPSA-Services

Gérard Bailly

Fonction : Auteur
PersonId : 444
IdHAL : gerard-bailly
ORCID : 0000-0002-6053-0818
IdRef : 033792135

GIPSA - Machines Parlantes, Agents Communicants & Interaction Face-à-face

Yuliya Tarabalka

Fonction : Auteur

GIPSA - Machines Parlantes, Agents Communicants & Interaction Face-à-face

Résumé

We present a methodology developed to derive three-dimensional models of speech articulators from volume MRI and multiple view video images acquired on one speaker. Linear component analysis is used to model these highly deformable articulators as the weighted sum of a small number of basic shapes corresponding to the articulators' degrees of freedom for speech. These models are assembled into an audiovisual talking head that can produce augmented audiovisual speech, i.e. can display usually non visible articulators such as tongue or velum. The talking head is then animated by recovering its control parameters by inversion from the coordinates of a small number of points of the articulators of the same speaker tracked by Electro-Magnetic Articulography. The augmented speech produced points the way to promising applications in the domain of speech therapy for speech retarded children, perception and production rehabilitation of hearing impaired children, and pronunciation training for second language learners.

Mots clés

Speech production audiovisual talking head augmented speech articulatory modelling articulatory measurement tongue reading

Production de parole tête parlante audiovisuelle parole augmentée modélisation articulatoire mesure articulatoire lecture linguale

Domaines

Sciences de l'information et de la communication

Pierre Badin : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00296599

Soumis le : dimanche 13 juillet 2008-13:27:53

Dernière modification le : jeudi 4 avril 2024-21:00:18

Dates et versions

hal-00296599 , version 1 (13-07-2008)

Identifiants

HAL Id : hal-00296599 , version 1
DOI : 10.1007/978-3-540-70517-8_14

Citer

Pierre Badin, Frédéric Elisei, Gérard Bailly, Yuliya Tarabalka. An audiovisual talking head for augmented speech generation: models and animations based on a real speaker's articulatory data. F.J. Perales & R.B. Fisher. Proceedings of the Vth Conference on Articulated Motion and Deformable Objects (AMDO 2008), 5098, Springer Verlag: Berlin, Heidelberg, Germany, pp.132-143, 2008, Lecture Notes in Computer Science, 5098, ⟨10.1007/978-3-540-70517-8_14⟩. ⟨hal-00296599⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS GIPSA GIPSA-DPC GIPSA-MPACIF

229 Consultations

0 Téléchargements

An audiovisual talking head for augmented speech generation: models and animations based on a real speaker's articulatory data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager