Multi-stage speaker diarization of broadcast news

Abstract : Abstract: This paper describes recent advances in speaker diarization with a multistage segmentation and clustering system, which incorporates a speaker identification step. This system builds upon the baseline audio partitioner used in the LIMSI broadcast news transcription system. The baseline partitioner provides a high cluster purity, but has a tendency to split data from speakers with a large quantity of data into several segment clusters. Several improvements to the baseline system have been made. First, the iterative Gaussian mixture model (GMM) clustering has been replaced by a Bayesian information criterion (BIC) agglomerative clustering. Second, an additional clustering stage has been added, using a GMM-based speaker identification method. Finally, a post-processing stage refines the segment boundaries using the output of a transcription system. On the National Institute of Standards and Technology (NIST) RT-04F and ESTER evaluation data, the multistage system reduces the speaker error by over 70% relative to the baseline system, and gives between 40% and 50% reduction relative to a single-stage BIC clustering system
Type de document :
Article dans une revue
IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2006, 14 (5), 〈10.1109/TASL.2006.878261〉
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01434241
Contributeur : Sylvain Meignier <>
Soumis le : mercredi 22 mars 2017 - 17:14:10
Dernière modification le : lundi 18 mars 2019 - 16:23:58
Document(s) archivé(s) le : vendredi 23 juin 2017 - 12:33:50

Fichier

sap_rt_diarization.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Claude Barras, Xuan Zhu, Sylvain Meignier, Jean-Luc Gauvain. Multi-stage speaker diarization of broadcast news. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2006, 14 (5), 〈10.1109/TASL.2006.878261〉. 〈hal-01434241〉

Partager

Métriques

Consultations de la notice

130

Téléchargements de fichiers

248