Skip to Main content Skip to Navigation
Conference papers

Speaker Diarization With Unsupervised Training Framework

Abstract : This paper investigates single and cross-show diarization based on an unsupervised i-vector framework, on French TV and Radio corpora. This framework uses speaker clustering as a way to automatically select data from unlabeled corpora to train i-vector PLDA models. Performances between supervised and unsupervised models are compared. The experimental results on two distinct test corpora (one TV, one Radio) show that unsupervised models perform as good as supervised models for both tasks. Such results indicate that performing an effective cross-show diarization on new language or new domain data in the future should not depend on the availability of manually annotated data.
Document type :
Conference papers
Complete list of metadata

Cited literature [19 references]  Display  Hide  Download
Contributor : sylvain meignier Connect in order to contact the contributor
Submitted on : Wednesday, March 22, 2017 - 12:18:11 AM
Last modification on : Wednesday, January 19, 2022 - 12:00:02 PM
Long-term archiving on: : Friday, June 23, 2017 - 12:34:26 PM


Files produced by the author(s)




Gaël Le Lan, Sylvain Meignier, Delphine Charlet, Paul Deléglise. Speaker Diarization With Unsupervised Training Framework. 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Mar 2016, Shanghai, China. pp.5, ⟨10.1109/ICASSP.2016.7472741⟩. ⟨hal-01433167⟩



Record views


Files downloads