Content-adaptive speech enhancement by a sparsely-activated dictionary plus low rank decomposition

Abstract : One powerful approach to speech enhancement employs strong models for both speech and noise, decomposing a mixture into the most likely combination. But if the noise encountered differs significantly from the system's assumptions, performance will suffer. In previous work, we proposed a speech enhancement model that decomposes the spectrogram into sparse activation of a dictionary of target speech templates, and a low-rank background model. This makes few assumptions about the noise, and gave appealing results on small excerpts of noisy speech. However, when processing whole conversations, the foreground speech may vary in its complexity and may be unevenly distributed throughout the recording, resulting in inaccurate decompositions for some segments. In this paper, we explore an adaptive formulation of our previous model that incorporates separate side information to guide the decomposition, making it able to better process entire conversations that may exhibit large variations in the speech content.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [30 references]  Display  Hide  Download
Contributor : I Papadopoulos <>
Submitted on : Monday, January 19, 2015 - 2:15:34 PM
Last modification on : Friday, July 27, 2018 - 7:09:23 PM
Document(s) archivé(s) le : Friday, September 11, 2015 - 7:10:22 AM


Files produced by the author(s)




Zhuo Chen, Hélène Papadopoulos, Daniel P.W. Ellis. Content-adaptive speech enhancement by a sparsely-activated dictionary plus low rank decomposition. IEEE Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA), May 2014, Nancy, France. pp.16-20, ⟨10.1109/HSCMA.2014.6843242⟩. ⟨hal-01104904⟩



Record views


Files downloads