A Source Localization/Separation/Respatialization System Based on Unsupervised Classification of Interaural Cues

Joan Mouba; Sylvain Marchand

Communication Dans Un Congrès Année : 2006

A Source Localization/Separation/Respatialization System Based on Unsupervised Classification of Interaural Cues

(1) , (1)

Joan Mouba

Fonction : Auteur

Laboratoire Bordelais de Recherche en Informatique

Sylvain Marchand

Fonction : Auteur

Laboratoire Bordelais de Recherche en Informatique

Résumé

In this paper we propose a complete computational system for Auditory Scene Analysis. This time-frequency system localizes, separates, and spatializes an arbitrary number of audio sources given only binaural signals. The localization is based on recent research frameworks, where interaural level and time differences are combined to derive a confident direction of arrival (azimuth) at each frequency bin. Here, the power-weighted histogram constructed in the azimuth space is modeled as a Gaussian Mixture Model, whose parameter structure is revealed through a weighted Expectation Maximization. Afterwards, a bank of Gaussian spatial filters is configured automatically to extract the sources with significant energy accordingly to a posterior probability. In this frequency-domain framework, we also inverse a geometrical and physical head model to derive an algorithm that simulates a source as originating from any azimuth angle.

Domaines

Autre [cs.OH]

Fichier principal

dafx0_p_.pdf (311.01 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Joan Mouba : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00307889

Soumis le : mardi 29 juillet 2008-15:10:44

Dernière modification le : vendredi 24 mars 2023-14:52:50

Archivage à long terme le : vendredi 5 octobre 2012-11:28:09

Dates et versions

hal-00307889 , version 1 (29-07-2008)

Identifiants

HAL Id : hal-00307889 , version 1

Citer

Joan Mouba, Sylvain Marchand. A Source Localization/Separation/Respatialization System Based on Unsupervised Classification of Interaural Cues. Proceedings of the Digital Audio Effects (DAFx06) Conference, Sep 2006, Canada. pp.233--238. ⟨hal-00307889⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS

291 Consultations

130 Téléchargements

A Source Localization/Separation/Respatialization System Based on Unsupervised Classification of Interaural Cues

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager