4423 articles – 2353 Notices  [english version]
HAL : hal-00664090, version 1

Voir la fiche concise  BibTeX,EndNote,...
Advances in audio source separation and multisource audio content retrieval
Vincent E.
SPIE Defense, Security, and Sensing, Baltimore : États-Unis (2012) - http://hal.inria.fr/hal-00664090
Conférences invitées
Informatique/Traitement du signal et de l'image
Sciences de l'ingénieur/Traitement du signal et de l'image
Advances in audio source separation and multisource audio content retrieval
Emmanuel Vincent () 1
1 :  METISS (INRIA - IRISA)
http://www.inria.fr/equipes/metiss
CNRS : UMR6074 – INRIA – Institut National des Sciences Appliquées (INSA) - Rennes – Université de Rennes 1
Campus de Beaulieu 35042 Rennes cedex
France
Audio source separation aims to extract the signals of individual sound sources from a given recording. In this paper, we review three recent advances which improve the robustness of source separation in real-world challenging scenarios and enable its use for multisource content retrieval tasks, such as automatic speech recognition (ASR) or acoustic event detection (AED) in noisy environments. We present a Flexible Audio Source Separation Toolkit (FASST) and discuss its advantages compared to earlier approaches such as independent component analysis (ICA) and sparse component analysis (SCA). We explain how cues as diverse as harmonicity, spectral envelope, temporal fine structure or spatial location can be jointly exploited by this toolkit. We subsequently present the uncertainty decoding (UD) framework for the integration of audio source separation and audio content retrieval. We show how the uncertainty about the separated source signals can be accurately estimated and propagated to the features. Finally, we explain how this uncertainty can be efficiently exploited by a classifier, both at the training and the decoding stage. We illustrate the resulting performance improvements in terms of speech separation quality and speaker recognition accuracy.
Anglais

25/04/2012
internationale
SPIE Defense, Security, and Sensing
Baltimore
États-Unis
23/04/2012
27/01/2012

Liste des fichiers attachés à ce document :
PDF
vincent_DSS12.pdf(144 KB)