Improved feature extraction for CRNN-based multiple sound source localization

Pierre-Amaury Grumiaux; Srdan Kitić; Laurent Girin; Alexandre Guérin

doi:10.23919/EUSIPCO54536.2021.9616124

Communication Dans Un Congrès Année : 2021

Improved feature extraction for CRNN-based multiple sound source localization

(1) , (1) , (2) , (1)

1
2

Pierre-Amaury Grumiaux

Fonction : Auteur
PersonId : 737841
IdHAL : pierre-amaury-grumiaux
ORCID : 0000-0001-5263-787X
IdRef : 253134757

Orange Labs [Cesson-Sévigné]

Srdan Kitić

Fonction : Auteur
PersonId : 1123717

Orange Labs [Cesson-Sévigné]

Laurent Girin

Fonction : Auteur
PersonId : 3682
IdHAL : laurent-girin
ORCID : 0000-0002-9214-8760
IdRef : 088998037

GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing

Alexandre Guérin

Fonction : Auteur
PersonId : 1036640

Orange Labs [Cesson-Sévigné]

Résumé

In this work, we propose to extend a state-of-the-art multi-source localization system based on a convolutional recurrent neural network and Ambisonics signals. We significantly improve the performance of the baseline network by changing the layout between convolutional and pooling layers. We propose several configurations with more convolutional layers and smaller pooling sizes in-between, so that less information is lost across the layers, leading to a better feature extraction. In parallel, we test the system's ability to localize up to 3 sources, in which case the improved feature extraction provides the most significant boost in accuracy. We evaluate and compare these improved configurations on synthetic and real-world data. The obtained results show a quite substantial improvement of the multiple sound source localization performance over the baseline network.

Mots clés

sound source localization convolutional recurrent neural network ambisonics reverberation

Domaines

Son [cs.SD] Intelligence artificielle [cs.AI] Réseau de neurones [cs.NE]

Fichier principal

eusipco2021.pdf (331.64 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Pierre-Amaury Grumiaux : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03537334

Soumis le : jeudi 20 janvier 2022-12:58:51

Dernière modification le : jeudi 4 avril 2024-20:49:41

Archivage à long terme le : jeudi 21 avril 2022-18:55:49

Dates et versions

hal-03537334 , version 1 (20-01-2022)

Identifiants

HAL Id : hal-03537334 , version 1
DOI : 10.23919/EUSIPCO54536.2021.9616124

Citer

Pierre-Amaury Grumiaux, Srdan Kitić, Laurent Girin, Alexandre Guérin. Improved feature extraction for CRNN-based multiple sound source localization. EUSIPCO 2021 - 29th European Signal Processing Conference (EUSIPCO), Aug 2021, Dublin, Ireland. pp.231-235, ⟨10.23919/EUSIPCO54536.2021.9616124⟩. ⟨hal-03537334⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS GIPSA GIPSA-CRISSP GIPSA-PPC

25 Consultations

85 Téléchargements

Improved feature extraction for CRNN-based multiple sound source localization

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager