An active learning method for speaker identity annotation in audio recordings

Pierre-Alexandre Broux; David Doukhan; Simon Petitrenaud; Sylvain Meignier; Jean Carrive

Communication Dans Un Congrès Année : 2016

An active learning method for speaker identity annotation in audio recordings

(1) , (1) , (2) , (2) , (1)

1
2

Pierre-Alexandre Broux

Fonction : Auteur
PersonId : 176980
IdHAL : pabroux

Institut National de l'Audiovisuel

David Doukhan

Fonction : Auteur

Institut National de l'Audiovisuel

Simon Petitrenaud

Fonction : Auteur
PersonId : 16717
IdHAL : simon-petitrenaud
ORCID : 0000-0003-2545-8379
IdRef : 183687515

Laboratoire d'Informatique de l'Université du Mans

Sylvain Meignier

Fonction : Auteur

Laboratoire d'Informatique de l'Université du Mans

Jean Carrive

Fonction : Auteur

Institut National de l'Audiovisuel

Résumé

Given that manual annotation of speech is an expensive and long process, we attempt in this paper to assist an anno-tator to perform a speaker diarization. This assistance takes place in an annotation background for a large amount of archives. We propose a method which decreases the intervention number of a human. This method corrects a diarization by taking into account the human interventions. The experiment is done using French broadcast TV shows drawn from ANR-REPERE evaluation campaign. Our method is mainly evaluated in terms of KSR (Keystroke Saving Rate), and we reduce the number of actions needed to correct a speaker diarization output by 6.8% in absolute value.

Domaines

Informatique et langage [cs.CL]

Fichier principal

paper5.pdf (377.59 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

HAKIM AMOKRANE : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01451532

Soumis le : jeudi 6 avril 2017-09:02:16

Dernière modification le : mercredi 9 octobre 2019-11:44:04

Archivage à long terme le : vendredi 7 juillet 2017-12:21:16

Dates et versions

hal-01451532 , version 1 (06-04-2017)

Identifiants

HAL Id : hal-01451532 , version 1

Citer

Pierre-Alexandre Broux, David Doukhan, Simon Petitrenaud, Sylvain Meignier, Jean Carrive. An active learning method for speaker identity annotation in audio recordings. 1st International Workshop on Multimodal Media Data Analytics (MMDA 2016), Aug 2016, La Haye, Netherlands. ⟨hal-01451532⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LEMANS LIUM LIUM-LST

221 Consultations

276 Téléchargements

An active learning method for speaker identity annotation in audio recordings

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager