A Source Localization/Separation/Respatialization System Based on Unsupervised Classification of Interaural Cues - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

A Source Localization/Separation/Respatialization System Based on Unsupervised Classification of Interaural Cues

Résumé

In this paper we propose a complete computational system for Auditory Scene Analysis. This time-frequency system localizes, separates, and spatializes an arbitrary number of audio sources given only binaural signals. The localization is based on recent research frameworks, where interaural level and time differences are combined to derive a confident direction of arrival (azimuth) at each frequency bin. Here, the power-weighted histogram constructed in the azimuth space is modeled as a Gaussian Mixture Model, whose parameter structure is revealed through a weighted Expectation Maximization. Afterwards, a bank of Gaussian spatial filters is configured automatically to extract the sources with significant energy accordingly to a posterior probability. In this frequency-domain framework, we also inverse a geometrical and physical head model to derive an algorithm that simulates a source as originating from any azimuth angle.

Domaines

Autre [cs.OH]
Fichier principal
Vignette du fichier
dafx0_p_.pdf (311.01 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00307889 , version 1 (29-07-2008)

Identifiants

  • HAL Id : hal-00307889 , version 1

Citer

Joan Mouba, Sylvain Marchand. A Source Localization/Separation/Respatialization System Based on Unsupervised Classification of Interaural Cues. Proceedings of the Digital Audio Effects (DAFx06) Conference, Sep 2006, Canada. pp.233--238. ⟨hal-00307889⟩

Collections

CNRS
291 Consultations
130 Téléchargements

Partager

Gmail Facebook X LinkedIn More