Phone adaptive training for speaker diarization - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Phone adaptive training for speaker diarization

Simon Bozonnet
  • Fonction : Auteur
  • PersonId : 903988
Nicholas Evans
  • Fonction : Auteur
  • PersonId : 938450

Résumé

The linguistic content of a speech signal is a source of unwanted variation which can degrade speaker diarization performance. This paper presents our latest work to reduce its impact. The new approach, referred to as Phone Adaptive Training (PAT), is analogous to speaker adaptive training used in automatic speech recognition. We report an oracle experiment which shows that PAT has the potential to deliver a 33% relative improvement in the diarization error rate of our baseline system. Practical experiments show significant improvements across two standard, independent evaluation datasets.
Fichier principal
Vignette du fichier
Phone_Adaptive_Training_for_Speaker_Diarization_7_1_.pdf (222.22 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00733385 , version 1 (18-09-2012)

Identifiants

  • HAL Id : hal-00733385 , version 1

Citer

Simon Bozonnet, Ravichander Vipperla, Nicholas Evans. Phone adaptive training for speaker diarization. INTERSPEECH 2012, Sep 2012, Portland, U.S. Outlying Islands. pp.1. ⟨hal-00733385⟩

Collections

EURECOM
73 Consultations
86 Téléchargements

Partager

Gmail Facebook X LinkedIn More