Improved feature extraction for CRNN-based multiple sound source localization - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Improved feature extraction for CRNN-based multiple sound source localization

Résumé

In this work, we propose to extend a state-of-the-art multi-source localization system based on a convolutional recurrent neural network and Ambisonics signals. We significantly improve the performance of the baseline network by changing the layout between convolutional and pooling layers. We propose several configurations with more convolutional layers and smaller pooling sizes in-between, so that less information is lost across the layers, leading to a better feature extraction. In parallel, we test the system's ability to localize up to 3 sources, in which case the improved feature extraction provides the most significant boost in accuracy. We evaluate and compare these improved configurations on synthetic and real-world data. The obtained results show a quite substantial improvement of the multiple sound source localization performance over the baseline network.
Fichier principal
Vignette du fichier
eusipco2021.pdf (331.64 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03537334 , version 1 (20-01-2022)

Identifiants

Citer

Pierre-Amaury Grumiaux, Srdan Kitić, Laurent Girin, Alexandre Guérin. Improved feature extraction for CRNN-based multiple sound source localization. EUSIPCO 2021 - 29th European Signal Processing Conference (EUSIPCO), Aug 2021, Dublin, Ireland. pp.231-235, ⟨10.23919/EUSIPCO54536.2021.9616124⟩. ⟨hal-03537334⟩
25 Consultations
85 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More