Automatic identification of vowels in the Cued Speech context

Noureddine Aboutabit; Denis Beautemps; Laurent Besacier

Communication Dans Un Congrès Année : 2007

Automatic identification of vowels in the Cued Speech context

(1) , (1) , (2, 3)

1
2
3

Noureddine Aboutabit

Fonction : Auteur

GIPSA - Machines Parlantes, Agents Communicants & Interaction Face-à-face

Denis Beautemps

Fonction : Auteur
PersonId : 18206
IdHAL : denis-beautemps
ORCID : 0000-0001-9625-3018
IdRef : 099427524

GIPSA - Machines Parlantes, Agents Communicants & Interaction Face-à-face

Laurent Besacier

Fonction : Auteur
PersonId : 1521
IdHAL : laurent-besacier
ORCID : 0000-0001-7411-9125
IdRef : 079377017

Communication Langagière et Interaction Personne-Système

Laboratoire d'Informatique de Grenoble

Résumé

The phonetic translation of Cued Speech (CS) (Cornett [1]) gestures needs to mix the manual CS information together with the lips, taking into account the desynchronization delay (Attina et al. [2], Aboutabit et al. [3]) between these two flows of information. The automatic coding of CS hand positions and lip targets (Aboutabit et al. [3], Aboutabit et al. [4]) are thus a key factor in the mixing process. This contribution focuses on the identification of vowels by merging CS hand positions and vocalic lip information produced by a CS speaker. The hand flow is coded automatically as plateaus between transition phases. A plateau is defined as the interval during which the hand is maintained at a specific CS hand position. A transition is the interval during which the hand moves from a specific CS hand position to another one. The CS hand position is automatically obtained as the result of the hand 2d coordinates Gaussian classification. The instants of reached hand targets are used as reference instants to define the interval inside which the lip target instant of the vowel is automatically detected. The lip parameters extracted at this instant are processed in a Gaussian classifier as to identify the vocalic lip feature of the vowel. The vowel is obtained as the result of the combination of the corresponding hand position and the lip feature. The global performance of the method attains 77.6% as correct identification score. This result does not take into account the CS coding errors. This result has to be compared with the global 83.5% score of speech reception by deaf people using CS (Nichols and Ling, 1982 [6].

Mots clés

lip target segmentation and CS gesture segmentation. Cued Speech production and CS gesture segmentation vocalic lip classification

Domaines

Sciences de l'information et de la communication

Fichier principal

Aboutabit_AVSP_publie.pdf (215.57 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Denis Beautemps : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00266694

Soumis le : mardi 25 mars 2008-11:44:48

Dernière modification le : jeudi 4 avril 2024-21:29:17

Archivage à long terme le : vendredi 21 mai 2010-00:51:25

Dates et versions

hal-00266694 , version 1 (25-03-2008)

Identifiants

HAL Id : hal-00266694 , version 1

Citer

Noureddine Aboutabit, Denis Beautemps, Laurent Besacier. Automatic identification of vowels in the Cued Speech context. AVSP 2007 - 6th International Conference on Auditory-Visual Speech Processing, Aug 2007, Hilvarenbeek, Netherlands. pp.8. ⟨hal-00266694⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA IMAG CNRS GIPSA GIPSA-DPC LIG GIPSA-MPACIF LIG_TDCGE LIG_TDCGE_GETALP POLYTECH-GRENOBLE LIG_SIDCH

246 Consultations

573 Téléchargements

Automatic identification of vowels in the Cued Speech context

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager