Automatic identification of vowels in the Cued Speech context

Abstract : The phonetic translation of Cued Speech (CS) (Cornett [1]) gestures needs to mix the manual CS information together with the lips, taking into account the desynchronization delay (Attina et al. [2], Aboutabit et al. [3]) between these two flows of information. The automatic coding of CS hand positions and lip targets (Aboutabit et al. [3], Aboutabit et al. [4]) are thus a key factor in the mixing process. This contribution focuses on the identification of vowels by merging CS hand positions and vocalic lip information produced by a CS speaker. The hand flow is coded automatically as plateaus between transition phases. A plateau is defined as the interval during which the hand is maintained at a specific CS hand position. A transition is the interval during which the hand moves from a specific CS hand position to another one. The CS hand position is automatically obtained as the result of the hand 2d coordinates Gaussian classification. The instants of reached hand targets are used as reference instants to define the interval inside which the lip target instant of the vowel is automatically detected. The lip parameters extracted at this instant are processed in a Gaussian classifier as to identify the vocalic lip feature of the vowel. The vowel is obtained as the result of the combination of the corresponding hand position and the lip feature. The global performance of the method attains 77.6% as correct identification score. This result does not take into account the CS coding errors. This result has to be compared with the global 83.5% score of speech reception by deaf people using CS (Nichols and Ling, 1982 [6].
Document type :
Conference papers
Complete list of metadatas

Cited literature [9 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00266694
Contributor : Denis Beautemps <>
Submitted on : Tuesday, March 25, 2008 - 11:44:48 AM
Last modification on : Tuesday, July 9, 2019 - 1:26:59 AM
Long-term archiving on : Friday, May 21, 2010 - 12:51:25 AM

File

Aboutabit_AVSP_publie.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00266694, version 1

Citation

Noureddine Aboutabit, Denis Beautemps, Laurent Besacier. Automatic identification of vowels in the Cued Speech context. International Conference on Auditory-Visual Speech Processing (AVSP 2007), Aug 2007, Hilvarenbeek, Netherlands. pp.8. ⟨hal-00266694⟩

Share

Metrics

Record views

442

Files downloads

809