Iterative PLDA Adaptation for Speaker Diarization

Gaël Le Lan; Delphine Charlet; Anthony Larcher; Sylvain Meignier

doi:10.21437/Interspeech.2016-572

Communication Dans Un Congrès Année : 2016

Iterative PLDA Adaptation for Speaker Diarization

(1, 2) , (2) , (1) , (1)

1
2

Gaël Le Lan

Fonction : Auteur
PersonId : 751878
IdHAL : gael-le-lan
ORCID : 0000-0002-1493-5777

Laboratoire d'Informatique de l'Université du Mans

Orange Labs [Lannion]

Delphine Charlet

Fonction : Auteur

Orange Labs [Lannion]

Anthony Larcher

Fonction : Auteur
PersonId : 20105
IdHAL : anthony-larcher
ORCID : 0000-0003-4398-0224
IdRef : 139544569

Laboratoire d'Informatique de l'Université du Mans

Sylvain Meignier

Fonction : Auteur
PersonId : 11674
IdHAL : sylvain-meignier
ORCID : 0000-0001-7687-073X
IdRef : 182269086

Laboratoire d'Informatique de l'Université du Mans

Résumé

This paper investigates iterative PLDA adaptation for cross-show speaker diarization applied to small collections of French TV archives based on an i-vector framework. Using the target collection itself for unsupervised adaptation, PLDA parameters are iteratively tuned while score normalization is applied for convergence. Performances are compared, using combinations of target and external data for training and adaptation. The experiments on two distinct target corpora show that the proposed framework can gradually improve an existing system trained on external annotated data. Such results indicate that performing speaker diarization on small collections of unlabeled audio archives should only rely on the availability of a sufficient boot-strap system, which can be incrementally adapted to every target collection. The proposed framework also widens the range of acceptable speaker clustering thresholds for a given performance objective.

Mots clés

speaker diarization PLDA unsupervised train- ing domain adaptation iterative training

Domaines

Informatique et langage [cs.CL]

Fichier principal

0572b.pdf (1.84 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

sylvain meignier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01433172

Soumis le : jeudi 30 mars 2017-22:48:06

Dernière modification le : mercredi 19 janvier 2022-12:00:02

Dates et versions

hal-01433172 , version 1 (30-03-2017)

Identifiants

HAL Id : hal-01433172 , version 1
DOI : 10.21437/Interspeech.2016-572

Citer

Gaël Le Lan, Delphine Charlet, Anthony Larcher, Sylvain Meignier. Iterative PLDA Adaptation for Speaker Diarization. Interspeech 2016, Sep 2016, San Francisco, United States. pp.2175 - 2179, ⟨10.21437/Interspeech.2016-572⟩. ⟨hal-01433172⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LEMANS LIUM LIUM-LST

664 Consultations

315 Téléchargements

Iterative PLDA Adaptation for Speaker Diarization

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager