Acoustic Pairing of Original and Dubbed Voices in the Context of Video Game Localization - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Acoustic Pairing of Original and Dubbed Voices in the Context of Video Game Localization

Résumé

The aim of this research work is the development of an automatic voice recommendation system for assisted voice casting. In this article, we propose preliminary work on acoustic pair-ing of original and dubbed voices. The voice segments are taken from a video game released in two different languages. The paired voice segments come from different languages but belong to the same video game character. Our wish is to exploit the relationship between a set of paired segments in order to model the perceptual aspects of a given character depending on the target language. We use a state-of-the-art approach in speaker recognition (i.e. based on the paradigm i-vector/PLDA). We first evaluate pairs of i-vectors using two different acoustic spaces, one for each of the targeted languages. Secondly, we perform a transformation in order to project the source-language i-vector into the target language. The results showed that this latest approach is able to improve significantly the accuracy. Finally, we challenge the system ability to model the latent information that holds the video-game character independently of the speaker, the linguistic content and the language .
Fichier principal
Vignette du fichier
main.pdf (228.14 Ko) Télécharger le fichier
confusion.pdf (81.39 Ko) Télécharger le fichier
confusion_matrix.pdf (25.19 Ko) Télécharger le fichier
poster.pdf (1.6 Mo) Télécharger le fichier
presentation.pdf (6.28 Mo) Télécharger le fichier
results.pdf (20.89 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Origine : Fichiers produits par l'(les) auteur(s)
Origine : Fichiers produits par l'(les) auteur(s)
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01572151 , version 1 (04-08-2017)

Licence

Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales

Identifiants

Citer

Adrien Gresse, Mickael Rouvier, Richard Dufour, Vincent Labatut, Jean-Francois Bonastre. Acoustic Pairing of Original and Dubbed Voices in the Context of Video Game Localization. Interspeech, Aug 2017, Stockholm, Sweden. pp.2839-2843, ⟨10.21437/Interspeech.2017-1311⟩. ⟨hal-01572151⟩

Collections

UNIV-AVIGNON LIA
293 Consultations
591 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More