Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization

Résumé

Acoustic speaker diarization is investigated for situations where a collection of shows from the same source needs to be processed. In this case, the same speaker should receive the same label across all shows. We compare different architectures for cross-show speaker diarization: the obvious concatenation of all shows, a hybrid system combining first a local clustering stage followed by a global clustering stage, and an incremental system which processes the shows in a predefined order and updates the speaker models accordingly. This latter system being best suited to real applicative situations. These three strategies were compared to a baseline single-show system on a set of 46 ten-minutes samples of British English scientific podcasts.
Fichier principal
Vignette du fichier
i11_1053.pdf (277.03 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01690265 , version 1 (23-01-2018)

Identifiants

  • HAL Id : hal-01690265 , version 1

Citer

Viet-Anh Tran, Viet Bac Le, Claude Barras, Lori Lamel. Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization. Interspeech 2011, Aug 2011, Florence, Italy. ⟨hal-01690265⟩
73 Consultations
98 Téléchargements

Partager

Gmail Facebook X LinkedIn More