"Sparsification" of audio signals using the MDCT/IntMDCT and a psychoacoustic model - Application to informed audio source separation

Jonathan Pinel; Laurent Girin

Communication Dans Un Congrès Année : 2011

"Sparsification" of audio signals using the MDCT/IntMDCT and a psychoacoustic model - Application to informed audio source separation

(1) , (2)

1
2

Jonathan Pinel

Fonction : Auteur
PersonId : 881863

GIPSA - Communication Information and Complex Systems

Laurent Girin

Fonction : Auteur
PersonId : 3682
IdHAL : laurent-girin
ORCID : 0000-0002-9214-8760
IdRef : 088998037

GIPSA - Machines parlantes, Gestes oro-faciaux, Interaction Face-à-face, Communication augmentée

Résumé

Sparse representations have proved a very useful tool in a variety of domain, e.g. speech/music source separation. As strictly sparse representations (in the sense of l0) are often impossible to achieve, other ways of studying signals sparsity have been proposed. In this paper, we revisit the irrelevance filtering analysis-synthesis approach proposed in (Balazs et al., IEEE Trans. ASLP, 18(1), 2010), where the TF coefficients that are below some masking threshold are set to zero. Instead of using the Gabor transform and a specific psychoacoustic model, we use tools directly inspired from perceptual audio coding, for instance MPEG-AAC. We show that significantly better "sparsification performances" are obtained on music signals, at lower computational cost. We then apply the sparsification process to the informed source separation (ISS) problem and show that it enables to significantly decrease the computational cost at the ISS decoder.

Mots clés

sparsification audio processing source separation

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

AES42_JP_LG.pdf (1.32 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Laurent Girin : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00695730

Soumis le : mercredi 9 mai 2012-16:50:22

Dernière modification le : jeudi 4 avril 2024-21:05:03

Archivage à long terme le : vendredi 30 novembre 2012-11:30:21

Dates et versions

hal-00695730 , version 1 (09-05-2012)

Identifiants

HAL Id : hal-00695730 , version 1

Citer

Jonathan Pinel, Laurent Girin. "Sparsification" of audio signals using the MDCT/IntMDCT and a psychoacoustic model - Application to informed audio source separation. AES 2011 - 42nd International Conference: Semantic Audio, Jul 2011, Ilmenau, Germany. pp.179-188. ⟨hal-00695730⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS GIPSA GIPSA-DIS GIPSA-DPC GIPSA-MAGIC GIPSA-CICS ANR

149 Consultations

198 Téléchargements

"Sparsification" of audio signals using the MDCT/IntMDCT and a psychoacoustic model - Application to informed audio source separation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager