Generalized Wiener filtering with fractional power spectrograms

Antoine Liutkus; Roland Badeau

Communication Dans Un Congrès Année : 2015

Generalized Wiener filtering with fractional power spectrograms

(1, 2) , (3)

1
2
3

Antoine Liutkus

Fonction : Auteur
PersonId : 2740
IdHAL : antoine-liutkus
ORCID : 0000-0002-3458-6498
IdRef : 167600419

Speech Modeling for Facilitating Oral-Based Communication

Analysis, perception and recognition of speech

Roland Badeau

Fonction : Auteur
PersonId : 1121
IdHAL : rbadeau
ORCID : 0000-0002-9630-6877
IdRef : 106938134

Télécom ParisTech

Résumé

In the recent years, many studies have focused on the single-sensor separation of independent waveforms using so-called soft-masking strategies, where the short term Fourier transform of the mixture is multiplied element-wise by a ratio of spectrogram models. When the signals are wide-sense stationary, this strategy is theoretically justified as an optimal Wiener filtering: the power spectrograms of the sources are supposed to add up to yield the power spectrogram of the mixture. However, experience shows that using fractional spectrograms instead, such as the amplitude, yields good performance in practice, because they experimentally better fit the additivity assumption. To the best of our knowledge, no probabilistic interpretation of this filtering procedure was available to date. In this paper, we show that assuming the additivity of fractional spectrograms for the purpose of building soft-masks can be understood as separating locally stationary alpha-stable harmonizable processes, alpha-harmonizable in short, thus justifying the procedure theoretically.

Mots clés

harmonizable processes Audio source separation probability theory alpha-stable random variables soft-masks

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP] Recherche d'information [cs.IR]

Fichier principal

ICASSP-harmonizable2.pdf (234.01 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Antoine Liutkus : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01110028

Soumis le : jeudi 18 juin 2015-16:40:36

Dernière modification le : jeudi 1 février 2024-10:03:39

Archivage à long terme le : mardi 25 avril 2017-18:14:07

Dates et versions

hal-01110028 , version 1 (10-02-2015)

hal-01110028 , version 2 (18-06-2015)

Identifiants

HAL Id : hal-01110028 , version 2

Citer

Antoine Liutkus, Roland Badeau. Generalized Wiener filtering with fractional power spectrograms. 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. ⟨hal-01110028v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-RENNES1 CNRS INRIA IRISA PARISTECH UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD UR1-MATH-STIC UNIV-PARIS-SACLAY UR1-UFR-ISTIC UNIV-RENNES LTCI IDS S2A ANR UR1-MATH-NUM

658 Consultations

1351 Téléchargements

Generalized Wiener filtering with fractional power spectrograms

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager