Is Incremental Cross-Show Speaker Diarization Efficient For Processing Large Volumes of Data?

Abstract : Current cross-show diarization systems are mainly based on an overall clustering process which handles all the shows within a collection simultaneously. This approach has already been studied in various situations and seems to be the best way so far to achieve low error rates. However, this process has limits in realistic applicative contexts where large and dynamically increasing collections have to be processed. In this paper we investigate the use of an incremental clustering cross-show speaker diarization architecture to iteratively process new shows within an existing collection. The new shows to be inserted are processed one after another, according to the chronological order of their broadcasting dates. Experiments were conducted on the data distributed for the ETAPE and the REPERE French evaluation campaigns. It consist of 142 hours of data collected from 310 shows, from a period from Sept. 2010 to Oct. 2012.
Type de document :
Communication dans un congrès
Interspeech, 2014, Singapour, Singapore. Interspeech 2014, 2014
Liste complète des métadonnées

Littérature citée [19 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01433257
Contributeur : Sylvain Meignier <>
Soumis le : samedi 1 avril 2017 - 00:40:40
Dernière modification le : jeudi 6 avril 2017 - 10:07:37
Document(s) archivé(s) le : dimanche 2 juillet 2017 - 12:18:51

Fichier

i14_0587.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : hal-01433257, version 1

Collections

Citation

Grégor Dupuy, Sylvain Meignier, Yannick Estève. Is Incremental Cross-Show Speaker Diarization Efficient For Processing Large Volumes of Data?. Interspeech, 2014, Singapour, Singapore. Interspeech 2014, 2014. 〈hal-01433257〉

Partager

Métriques

Consultations de la notice

179

Téléchargements de fichiers

33