| HAL : hal-00629263, version 1 |
| Fiche détaillée | Récupérer au format |
|
|
| 19th European Signal Processing Conference (EUSIPCO 2011), Barcelonne : Spain (2011) |
|
|
|
|
| Sound effect on visual gaze when looking at videos |
|
|
| Guanghan Song 1Denis Pellerin 1 |
|
|
| LIMA Région Rhône-Alpes Collaboration(s) |
|
|
| (2011) |
|
|
| This paper presents an analysis of sound effect on visual gaze when looking at videos to help to predict eye positions. First, an audio-visual experiment was designed with two groups of participants, with audio-visual (AV) and visual (V) conditions to test the sound effect. We classify the sound in three classes: on-screen speech, non-speech and non-sound. We observe with statistical methods that the sound effect is different depending on the class of sound. Then a comparison of the experimental data and a visual saliency model was carried out, which proves that adding sound to video decreases the accuracy of the prediction of the visual saliency model without a sound pathway. Finally, the result of locating the coordinates of a sound source manually provides a viable aspect of sound pathway for future work. |
|
|
|
|
|
|
|
|
|
|
| 1 : | Grenoble Images Parole Signal Automatique (GIPSA-lab) |
| CNRS : UMR5216 – Université Joseph Fourier - Grenoble I – Université Pierre-Mendès-France - Grenoble II – Université Stendhal - Grenoble III – Institut Polytechnique de Grenoble - Grenoble Institute of Technology | |
|
|
|
|
|
|
|
|
| Domaine | : | Informatique/Traitement des images |
|
|
| Sound – Visual saliency – Saliency model – Gaze prediction – Video |
|
|
| Liste des fichiers attachés à ce document : | |||||
|
|
|
| hal-00629263, version 1 | |
| http://hal.archives-ouvertes.fr/hal-00629263 | |
| oai:hal.archives-ouvertes.fr:hal-00629263 | |
| Contributeur : Guanghan Song | |
| Soumis le : Mercredi 5 Octobre 2011, 14:38:35 | |
| Dernière modification le : Vendredi 7 Octobre 2011, 16:24:47 | |