Is Incremental Cross-Show Speaker Diarization Efficient For Processing Large Volumes of Data?

Grégor Dupuy; Sylvain Meignier; Yannick Estève

Communication Dans Un Congrès Année : 2014

Is Incremental Cross-Show Speaker Diarization Efficient For Processing Large Volumes of Data?

(1) , (1) , (1)

Grégor Dupuy

Fonction : Auteur
PersonId : 776540
IdRef : 188635548

Laboratoire d'Informatique de l'Université du Mans

Sylvain Meignier

Fonction : Auteur
PersonId : 11674
IdHAL : sylvain-meignier
ORCID : 0000-0001-7687-073X
IdRef : 182269086

Laboratoire d'Informatique de l'Université du Mans

Yannick Estève

Fonction : Auteur
PersonId : 11645
IdHAL : yannick-esteve
ORCID : 0000-0002-3656-8883
IdRef : 070531668

Laboratoire d'Informatique de l'Université du Mans

Résumé

Current cross-show diarization systems are mainly based on an overall clustering process which handles all the shows within a collection simultaneously. This approach has already been studied in various situations and seems to be the best way so far to achieve low error rates. However, this process has limits in realistic applicative contexts where large and dynamically increasing collections have to be processed. In this paper we investigate the use of an incremental clustering cross-show speaker diarization architecture to iteratively process new shows within an existing collection. The new shows to be inserted are processed one after another, according to the chronological order of their broadcasting dates. Experiments were conducted on the data distributed for the ETAPE and the REPERE French evaluation campaigns. It consist of 142 hours of data collected from 310 shows, from a period from Sept. 2010 to Oct. 2012.

Mots clés

speaker diarization incremental architecture cross-show ILP clustering i-vectors

Domaines

Informatique et langage [cs.CL]

Fichier principal

i14_0587.pdf (351.36 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

sylvain meignier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01433257

Soumis le : samedi 1 avril 2017-00:40:40

Dernière modification le : mardi 8 décembre 2020-09:44:14

Archivage à long terme le : dimanche 2 juillet 2017-12:18:51

Dates et versions

hal-01433257 , version 1 (01-04-2017)

Identifiants

HAL Id : hal-01433257 , version 1

Citer

Grégor Dupuy, Sylvain Meignier, Yannick Estève. Is Incremental Cross-Show Speaker Diarization Efficient For Processing Large Volumes of Data?. Interspeech, 2014, Singapour, Singapore. ⟨hal-01433257⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LEMANS LIUM LIUM-LST ANR

150 Consultations

54 Téléchargements

Is Incremental Cross-Show Speaker Diarization Efficient For Processing Large Volumes of Data?

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager