Visual Contribution to Speech Perception: Measuring the Intelligibility of Animated Talking Heads

Slim Ouni 1 Michael Cohen 2 Hope Ishak 2 Dominic Massaro 2
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Animated agents are becoming increasingly frequent in research and applications in speech science. An important challenge is to evaluate the effectiveness of the agent in terms of the intelligibility of its visible speech. In three experiments, we extend and test the Sumby and Pollack (1954) metric to allow the comparison of an agent relative to a standard or reference, and also propose a new metric based on the fuzzy logical model of perception (FLMP) to describe the benefit provided by a synthetic animated face relative to the benefit provided by a natural face. A valid metric would allow direct comparisons accross different experiments and would give measures of the benfit of a synthetic animated face relative to a natural face (or indeed any two conditions) and how this benefit varies as a function of the type of synthetic face, the test items (e.g., syllables versus sentences), different individuals, and applications.
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-00184425
Contributeur : Slim Ouni <>
Soumis le : mercredi 31 octobre 2007 - 09:47:42
Dernière modification le : jeudi 11 janvier 2018 - 06:19:56

Lien texte intégral

Identifiants

Collections

Citation

Slim Ouni, Michael Cohen, Hope Ishak, Dominic Massaro. Visual Contribution to Speech Perception: Measuring the Intelligibility of Animated Talking Heads. EURASIP Journal on Audio, Speech, and Music Processing, SpringerOpen, 2007, 2007, pp.ID 47891. 〈10.1155/2007/47891〉. 〈hal-00184425〉

Partager

Métriques

Consultations de la notice

439