Audio Visual Integration with Competing Sources in the Framework of Audio Visual Speech Scene Analysis

Attigodu Chandrashekara Ganesh; Frédéric Berthommier; Jean-Luc Schwartz

doi:10.1007/978-3-319-25474-6_42

Chapitre D'ouvrage Année : 2016

Audio Visual Integration with Competing Sources in the Framework of Audio Visual Speech Scene Analysis

(1) , (1) , (1)

Attigodu Chandrashekara Ganesh

Fonction : Auteur correspondant

GIPSA - Perception, Contrôle, Multimodalité et Dynamiques de la parole

Frédéric Berthommier

Fonction : Auteur
PersonId : 883073

GIPSA - Perception, Contrôle, Multimodalité et Dynamiques de la parole

Jean-Luc Schwartz

Fonction : Auteur
PersonId : 1160
IdHAL : jean-luc-schwartz
ORCID : 0000-0001-8969-9185
IdRef : 033230374

GIPSA - Perception, Contrôle, Multimodalité et Dynamiques de la parole

Résumé

We introduce “Audio-Visual Speech Scene Analysis” (AVSSA) as an extension of the two-stage Auditory Scene Analysis model towards audiovisual scenes made of mixtures of speakers. AVSSA assumes that a coherence index between the auditory and the visual input is computed prior to audiovisual fusion, enabling to determine whether the sensory inputs should be bound together. Previous experiments on the modulation of the McGurk effect by audiovisual coherent vs. incoherent contexts presented before the McGurk target have provided experimental evidence supporting AVSSA. Indeed, incoherent contexts appear to decrease the McGurk effect, suggesting that they produce lower audiovisual coherence hence less audiovisual fusion. The present experiments extend the AVSSA paradigm by creating contexts made of competing audiovisual sources and measuring their effect on McGurk targets. The competing audiovisual sources have respectively a high and a low audiovisual coherence (that is, large vs. small audiovisual comodulations in time). The first experiment involves contexts made of two auditory sources and one video source associated to either the first or the second audio source. It appears that the McGurk effect is smaller after the context made of the visual source associated to the auditory source with less audiovisual coherence. In the second experiment with the same stimuli, the participants are asked to attend to either one or the other source. The data show that the modulation of fusion depends on the attentional focus. Altogether, these two experiments shed light on audiovisual binding, the AVSSA process and the role of attention.

Mots clés

Audio visual binding Auditory speech analysis McGurk effect Attention

Domaines

Sciences cognitives

Fichier principal

Ganesh-et-al-2016.pdf (1.02 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Frédéric Berthommier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01421589

Soumis le : jeudi 22 décembre 2016-15:43:15

Dernière modification le : jeudi 4 avril 2024-20:58:30

Archivage à long terme le : mardi 21 mars 2017-01:26:16

Dates et versions

hal-01421589 , version 1 (22-12-2016)

Identifiants

HAL Id : hal-01421589 , version 1
DOI : 10.1007/978-3-319-25474-6_42

Citer

Attigodu Chandrashekara Ganesh, Frédéric Berthommier, Jean-Luc Schwartz. Audio Visual Integration with Competing Sources in the Framework of Audio Visual Speech Scene Analysis . van Dijk P., Başkent D., Gaudrain E., de Kleine E., Wagner A., Lanting C. Physiology, Psychoacoustics and Cognition in Normal and Impaired Hearing, 894, Springer, pp.399-408, 2016, Advances in Experimental Medicine and Biology, ⟨10.1007/978-3-319-25474-6_42⟩. ⟨hal-01421589⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS GIPSA GIPSA-DPC GIPSA-PCMD

331 Consultations

81 Téléchargements

Audio Visual Integration with Competing Sources in the Framework of Audio Visual Speech Scene Analysis

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager