Towards Lifelong Human Assisted Speaker Diarization - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Computer Speech and Language Année : 2023

Towards Lifelong Human Assisted Speaker Diarization

Résumé

This paper introduces the resources necessary to develop and evaluate human assisted lifelong learning speaker diarization systems. It describes the ALLIES corpus and associated protocols, especially designed for diarization of a collection audio recordings across time. This dataset is compared to existing corpora and the performances of three baseline systems, based on x-vectors, i-vectors and VBxHMM, are reported for reference. Those systems are then extended to include an active correction process that efficiently guides a human annotator to improve the automatically generated hypotheses. An open-source simulated human expert is provided to ensure reproducibility of the human assisted correction process and its fair evaluation. An exhaustive evaluation, of the human assisted correction shows the high potential of this approach. The ALLIES corpus, a baseline system including the active correction module and all evaluation tools are made freely available to the scientific community.
Fichier principal
Vignette du fichier
Towards Lifelong Human Assisted Speaker Diarization.pdf (868.89 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03737796 , version 1 (29-07-2022)

Identifiants

Citer

Meysam Shamsi, Anthony Larcher, Loïc Barrault, Sylvain Meignier, Yevheni Prokopalo, et al.. Towards Lifelong Human Assisted Speaker Diarization. Computer Speech and Language, 2023, 77, pp.101437. ⟨10.1016/j.csl.2022.101437⟩. ⟨hal-03737796⟩
106 Consultations
135 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More