Skip to Main content Skip to Navigation
Conference papers

First investigations on self trained speaker diarization

Abstract : This paper investigates self trained cross-show speaker diarization applied to collections of French TV archives, based on an i-vector/PLDA framework. The parameters used for i-vectors extraction and PLDA scoring are trained in a unsupervised way, using the data of the collection itself. Performances are compared, using combinations of target data and external data for training. The experimental results on two distinct target cor- pora show that using data from the corpora themselves to perform unsupervised iterative training and domain adaptation of PLDA parameters can improve an existing system, trained on external annotated data. Such results indicate that perform- ing speaker indexation on small collections of unlabeled audio archives should only rely on the availability of a sufficient external corpus, which can be specifically adapted to every target collection. We show that a minimum collection size is required to exclude the use of such an external bootstrap.
Complete list of metadatas

Cited literature [19 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01433173
Contributor : Sylvain Meignier <>
Submitted on : Friday, March 24, 2017 - 10:59:42 PM
Last modification on : Tuesday, June 19, 2018 - 11:50:04 AM
Document(s) archivé(s) le : Sunday, June 25, 2017 - 12:32:14 PM

File

50.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : hal-01433173, version 1

Collections

Citation

Gaël Le Lan, Sylvain Meignier, Delphine Charlet, Anthony Larcher. First investigations on self trained speaker diarization. Speaker and Language Recognition Workshop (Speaker Odyssey), Jun 2016, Bilbao, Spain. pp.152-157. ⟨hal-01433173⟩

Share

Metrics

Record views

331

Files downloads

182