"Sheldon speaking, bonjour!": Leveraging Multilingual Tracks for (Weakly) Supervised Speaker Identification - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

"Sheldon speaking, bonjour!": Leveraging Multilingual Tracks for (Weakly) Supervised Speaker Identification

Résumé

We address the problem of speaker identification in multimedia data, and TV series in particular. While speaker identification is traditionally a supervised machine-learning task, our first contribution is to significantly reduce the need for costly preliminary manual annotations through the use of automatically aligned (and potentially noisy) fan-generated transcripts and subtitles. We show that both speech activity detection and speech turn identification modules trained in this weakly supervised manner achieve similar performance as their fully supervised counterparts (i.e. relying on fine manual speech/non-speech/speaker annotation). Our second contribution relates to the use of multilingual audio tracks usually available with this kind of content to significantly improve the overall speaker identification performance. Reproducible experiments (including dataset, manual annotations and source code) performed on the first six episodes of The Big Bang Theory TV series show that combining the French audio track (containing dubbed actor voices) with the English one (with the original actor voices) improves the overall English speaker identification performance by 5% absolute and up to 70% relative on the five main characters.
Fichier non déposé

Dates et versions

hal-01987812 , version 1 (21-01-2019)

Identifiants

  • HAL Id : hal-01987812 , version 1

Citer

Hervé Bredin, Anindya Roy, Nicolas Pécheux, Alexandre Allauzen. "Sheldon speaking, bonjour!": Leveraging Multilingual Tracks for (Weakly) Supervised Speaker Identification. ACM MM 2014, 22nd ACM International Conference on Multimedia, 2014, Orlando, United States. ⟨hal-01987812⟩
24 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More