An EM Algorithm for Joint Source Separation and Diarisation of Multichannel Convolutive Speech Mixtures

We present a probabilistic model for joint source separation and diarisation of multichannel convolutive speech mixtures. We build upon the framework of local Gaussian model (LGM) with non-negative matrix factorization (NMF). The diarisa-tion is introduced as a temporal labeling of each source in the mix as active or inactive at the short-term frame level. We devise an EM algorithm in which the source separation process is aided by the diarisation state, since the latter indicates the sources actually present in the mixture. The diarisation state is tracked with a Hidden Markov Model (HMM) with emission probabilities calculated from the estimated source signals. The proposed EM has separation performance comparable with a state-of-the-art LGM NMF method, while out-performing a state-of-the-art speaker diarisation pipeline.

Mots clés

speaker diarisation local Gaussian model Audio source separation

Domaines

Son [cs.SD] Traitement du signal et de l'image [eess.SP] Apprentissage [cs.LG]

Fichier principal

diarisation_camready.pdf (266.05 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Perception team : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01430761

Soumis le : mardi 10 janvier 2017-11:24:31

Dernière modification le : jeudi 4 avril 2024-21:20:15

Archivage à long terme le : mardi 11 avril 2017-14:14:03

Dates et versions

hal-01430761 , version 1 (10-01-2017)

Identifiants

HAL Id : hal-01430761 , version 1
DOI : 10.1109/ICASSP.2017.7951789

Citer

Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud. An EM Algorithm for Joint Source Separation and Diarisation of Multichannel Convolutive Speech Mixtures. ICASSP 2017 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2017, New Orleans, United States. pp.16-20, ⟨10.1109/ICASSP.2017.7951789⟩. ⟨hal-01430761⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA GIPSA GIPSA-DPC LJK LJK_GI LJK_GI_PERCEPTION GIPSA-CRISSP INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

773 Consultations

465 Téléchargements