Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking

Yutong Ban 1 Xiaofei Li 1 Xavier Alameda-Pineda 1 Laurent Girin 2, 1 Radu Horaud 1
1 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
2 GIPSA-CRISSP - CRISSP
GIPSA-DPC - Département Parole et Cognition
Abstract : Multiple-speaker tracking is a crucial task for many applications. In real-world scenarios, exploiting the complementarity between auditory and visual data enables to track people outside the visual field of view. However, practical methods must be robust to changes in acoustic conditions, e.g. reverberation. We investigate how to combine state-of-the-art audio-source localization techniques with Bayesian multi-person tracking. Our experiments demonstrate that the performance of the proposed system is not affected by changes in the acoustic environment.
Type de document :
Communication dans un congrès
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Apr 2018, Calgary, Alberta, Canada. ICASSP 2018 - Proceedings
Liste complète des métadonnées

Littérature citée [20 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01718114
Contributeur : Team Perception <>
Soumis le : mardi 27 février 2018 - 10:34:57
Dernière modification le : vendredi 27 juillet 2018 - 11:15:42
Document(s) archivé(s) le : lundi 28 mai 2018 - 17:05:09

Fichier

Ban-ICASSP18.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01718114, version 1

Citation

Yutong Ban, Xiaofei Li, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud. Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Apr 2018, Calgary, Alberta, Canada. ICASSP 2018 - Proceedings. 〈hal-01718114〉

Partager

Métriques

Consultations de la notice

340

Téléchargements de fichiers

147