Content-adaptive speech enhancement by a sparsely-activated dictionary plus low rank decomposition

Zhuo Chen; Hélène Papadopoulos; Daniel P.W. Ellis

doi:10.1109/HSCMA.2014.6843242

Communication Dans Un Congrès Année : 2014

Content-adaptive speech enhancement by a sparsely-activated dictionary plus low rank decomposition

(1) , (2) , (1)

1
2

Zhuo Chen

Fonction : Auteur

labROSA

Hélène Papadopoulos

Fonction : Auteur

Laboratoire des signaux et systèmes

Daniel P.W. Ellis

Fonction : Auteur

labROSA

Résumé

One powerful approach to speech enhancement employs strong models for both speech and noise, decomposing a mixture into the most likely combination. But if the noise encountered differs significantly from the system's assumptions, performance will suffer. In previous work, we proposed a speech enhancement model that decomposes the spectrogram into sparse activation of a dictionary of target speech templates, and a low-rank background model. This makes few assumptions about the noise, and gave appealing results on small excerpts of noisy speech. However, when processing whole conversations, the foreground speech may vary in its complexity and may be unevenly distributed throughout the recording, resulting in inaccurate decompositions for some segments. In this paper, we explore an adaptive formulation of our previous model that incorporates separate side information to guide the decomposition, making it able to better process entire conversations that may exhibit large variations in the speech content.

Mots clés

low-rank robustPCA sparse IndexTerms—speechenhancement spectrogramdecomposi-tion voiceactivitydetection

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

Final_Submitted_HSCMA.pdf (831.15 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

i Papadopoulos : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01104904

Soumis le : lundi 19 janvier 2015-14:15:34

Dernière modification le : dimanche 17 mars 2024-11:46:04

Archivage à long terme le : vendredi 11 septembre 2015-07:10:22

Dates et versions

hal-01104904 , version 1 (19-01-2015)

Identifiants

HAL Id : hal-01104904 , version 1
DOI : 10.1109/HSCMA.2014.6843242

Citer

Zhuo Chen, Hélène Papadopoulos, Daniel P.W. Ellis. Content-adaptive speech enhancement by a sparsely-activated dictionary plus low rank decomposition. IEEE Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA), May 2014, Nancy, France. pp.16-20, ⟨10.1109/HSCMA.2014.6843242⟩. ⟨hal-01104904⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

SUPELEC EC-PARIS CNRS SUP_LSS SUP_SIGNAUX UNIV-PARIS-SACLAY

70 Consultations

218 Téléchargements

Content-adaptive speech enhancement by a sparsely-activated dictionary plus low rank decomposition

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager