Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking

Résumé

Multiple-speaker tracking is a crucial task for many applications. In real-world scenarios, exploiting the complementarity between auditory and visual data enables to track people outside the visual field of view. However, practical methods must be robust to changes in acoustic conditions, e.g. reverberation. We investigate how to combine state-of-the-art audio-source localization techniques with Bayesian multi-person tracking. Our experiments demonstrate that the performance of the proposed system is not affected by changes in the acoustic environment.
Fichier principal
Vignette du fichier
Ban-ICASSP18.pdf (2.28 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01718114 , version 1 (27-02-2018)

Identifiants

Citer

Yutong Ban, Xiaofei Li, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud. Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking. ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Alberta, Canada. pp.6553-6557, ⟨10.1109/ICASSP.2018.8462100⟩. ⟨hal-01718114⟩
341 Consultations
468 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More