Step-by-step and integrated approaches in broadcast news speaker diarization

Abstract : This paper summarizes the collaboration of the LIA and CLIPS laboratories on speaker diarization of broadcast news during the spring NIST Rich Transcription 2003 evaluation campaign (NIST-RTÕ03S). The speaker diarization task consists of segmenting a conversation into homogeneous segments which are then grouped into speaker classes. Two approaches are described and compared for speaker diarization. The first one relies on a classical two-step speaker diarization strategy based on a detection of speaker turns followed by a clustering process, while the second one uses an integrated strategy where both segment boundaries and speaker tying of the segments are extracted simultaneously and challenged during the whole process. These two methods are used to investigate various strategies for the fusion of diarization results. Furthermore, segmentation into acoustic macro-classes is proposed and evaluated as a priori step to speaker diarization. The objective is to take advantage of the a priori acoustic information in the diariza-tion process. Along with enriching the resulting segmentation with information about speaker gender,
Type de document :
Article dans une revue
Computer Speech and Language, Elsevier, 2006, Odyssey 2004: The speaker and Language Recognition Workshop Odyssey-04, Odyssey 2004: The speaker and Language Recognition Workshop, 20 (2-3), pp.303-330. 〈10.1016/j.csl.2005.08.002〉
Liste complète des métadonnées

Littérature citée [31 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01318554
Contributeur : Bibliothèque Universitaire Déposants Hal-Avignon <>
Soumis le : vendredi 24 mars 2017 - 23:29:50
Dernière modification le : samedi 23 mars 2019 - 01:22:40
Document(s) archivé(s) le : dimanche 25 juin 2017 - 12:32:41

Fichier

lia-clips.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Sylvain Meignier, Daniel Moraru, Corinne Fredouille, Jean-François Bonastre, Laurent Besacier. Step-by-step and integrated approaches in broadcast news speaker diarization. Computer Speech and Language, Elsevier, 2006, Odyssey 2004: The speaker and Language Recognition Workshop Odyssey-04, Odyssey 2004: The speaker and Language Recognition Workshop, 20 (2-3), pp.303-330. 〈10.1016/j.csl.2005.08.002〉. 〈hal-01318554〉

Partager

Métriques

Consultations de la notice

448

Téléchargements de fichiers

209