Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization

Abstract : Acoustic speaker diarization is investigated for situations where a collection of shows from the same source needs to be processed. In this case, the same speaker should receive the same label across all shows. We compare different architectures for cross-show speaker diarization: the obvious concatenation of all shows, a hybrid system combining first a local clustering stage followed by a global clustering stage, and an incremental system which processes the shows in a predefined order and updates the speaker models accordingly. This latter system being best suited to real applicative situations. These three strategies were compared to a baseline single-show system on a set of 46 ten-minutes samples of British English scientific podcasts.
Document type :
Conference papers
Complete list of metadatas

Cited literature [8 references]  Display  Hide  Download
Contributor : Claude Barras <>
Submitted on : Tuesday, January 23, 2018 - 5:13:57 PM
Last modification on : Saturday, May 4, 2019 - 1:21:23 AM
Long-term archiving on : Thursday, May 24, 2018 - 9:36:21 AM


Publisher files allowed on an open archive


  • HAL Id : hal-01690265, version 1



Viet-Anh Tran, Viet Le, Claude Barras, Lori Lamel. Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization. Interspeech 2011, Aug 2011, Florence, Italy. ⟨hal-01690265⟩



Record views


Files downloads