I-vectors and ILP clustering adapted to cross-show speaker diarization

Grégor Dupuy; Mickael Rouvier; Sylvain Meignier; Yannick Estève

Communication Dans Un Congrès Année : 2012

I-vectors and ILP clustering adapted to cross-show speaker diarization

(1) , (1) , (1) , (1)

Grégor Dupuy

Fonction : Auteur
PersonId : 776540
IdRef : 188635548

Laboratoire d'Informatique de l'Université du Mans

Mickael Rouvier

Fonction : Auteur

Laboratoire d'Informatique de l'Université du Mans

Sylvain Meignier

Fonction : Auteur
PersonId : 11674
IdHAL : sylvain-meignier
ORCID : 0000-0001-7687-073X
IdRef : 182269086

Laboratoire d'Informatique de l'Université du Mans

Yannick Estève

Fonction : Auteur
PersonId : 11645
IdHAL : yannick-esteve
ORCID : 0000-0002-3656-8883
IdRef : 070531668

Laboratoire d'Informatique de l'Université du Mans

Résumé

We propose to study speaker diarization from a collection of audio documents. The goal is to detect speakers appearing in several shows. In our approach, each show of the collection is processed separately before being processed collectively , to group speakers involved in several shows. Two clustering methods are studied for the overall processing of the collection: one uses the NCLR metric and the other is inspired by techniques based on i-vectors, mainly used in the speaker verification field. Both methods were evaluated on the whole training corpus of ESTER 2. The method based on the use of i-vectors achieves error rates similar to those obtained by the NCLR method, however, the computation time is on average 8.66 times faster. Therefore, this method is suitable for processing large volumes of data.

Mots clés

speaker diarization cross-show diarization i-vectors ILP clustering

Domaines

Informatique et langage [cs.CL]

Fichier principal

i12_2174.pdf (239.58 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

HAKIM AMOKRANE : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01450711

Soumis le : lundi 3 avril 2017-21:50:48

Dernière modification le : mardi 8 décembre 2020-09:44:15

Archivage à long terme le : mardi 4 juillet 2017-14:52:13

Dates et versions

hal-01450711 , version 1 (03-04-2017)

Identifiants

HAL Id : hal-01450711 , version 1

Citer

Grégor Dupuy, Mickael Rouvier, Sylvain Meignier, Yannick Estève. I-vectors and ILP clustering adapted to cross-show speaker diarization. Interspeech, 2012, Portland, Oregon (USA), United States. ⟨hal-01450711⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LEMANS LIUM LIUM-LST ANR

187 Consultations

116 Téléchargements

I-vectors and ILP clustering adapted to cross-show speaker diarization

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager