Iterative PLDA Adaptation for Speaker Diarization - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Iterative PLDA Adaptation for Speaker Diarization

Résumé

This paper investigates iterative PLDA adaptation for cross-show speaker diarization applied to small collections of French TV archives based on an i-vector framework. Using the target collection itself for unsupervised adaptation, PLDA parameters are iteratively tuned while score normalization is applied for convergence. Performances are compared, using combinations of target and external data for training and adaptation. The experiments on two distinct target corpora show that the proposed framework can gradually improve an existing system trained on external annotated data. Such results indicate that performing speaker diarization on small collections of unlabeled audio archives should only rely on the availability of a sufficient boot-strap system, which can be incrementally adapted to every target collection. The proposed framework also widens the range of acceptable speaker clustering thresholds for a given performance objective.
Fichier principal
Vignette du fichier
0572b.pdf (1.84 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01433172 , version 1 (30-03-2017)

Identifiants

Citer

Gaël Le Lan, Delphine Charlet, Anthony Larcher, Sylvain Meignier. Iterative PLDA Adaptation for Speaker Diarization. Interspeech 2016, Sep 2016, San Francisco, United States. pp.2175 - 2179, ⟨10.21437/Interspeech.2016-572⟩. ⟨hal-01433172⟩
664 Consultations
315 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More