Acoustic Pairing of Original and Dubbed Voices in the Context of Video Game Localization

Adrien Gresse; Mickael Rouvier; Richard Dufour; Vincent Labatut; Jean-Francois Bonastre

doi:10.21437/Interspeech.2017-1311

Communication Dans Un Congrès Année : 2017

Acoustic Pairing of Original and Dubbed Voices in the Context of Video Game Localization

(1) , (1) , (1) , (1) , (1)

Adrien Gresse

Fonction : Auteur
PersonId : 172309
IdHAL : adrien-gresse

Laboratoire Informatique d'Avignon

Mickael Rouvier

Fonction : Auteur

Laboratoire Informatique d'Avignon

Richard Dufour

Fonction : Auteur
PersonId : 178348
IdHAL : richard-dufour
ORCID : 0000-0003-1203-9108

Laboratoire Informatique d'Avignon

Vincent Labatut

Fonction : Auteur
PersonId : 482
IdHAL : vlabatut
ORCID : 0000-0002-2619-2835
IdRef : 076951375

Laboratoire Informatique d'Avignon

Jean-Francois Bonastre

Fonction : Auteur
PersonId : 172421
IdHAL : jean-francois-bonastre
ORCID : 0000-0001-7741-3346
IdRef : 079112978

Laboratoire Informatique d'Avignon

Résumé

The aim of this research work is the development of an automatic voice recommendation system for assisted voice casting. In this article, we propose preliminary work on acoustic pair-ing of original and dubbed voices. The voice segments are taken from a video game released in two different languages. The paired voice segments come from different languages but belong to the same video game character. Our wish is to exploit the relationship between a set of paired segments in order to model the perceptual aspects of a given character depending on the target language. We use a state-of-the-art approach in speaker recognition (i.e. based on the paradigm i-vector/PLDA). We first evaluate pairs of i-vectors using two different acoustic spaces, one for each of the targeted languages. Secondly, we perform a transformation in order to project the source-language i-vector into the target language. The results showed that this latest approach is able to improve significantly the accuracy. Finally, we challenge the system ability to model the latent information that holds the video-game character independently of the speaker, the linguistic content and the language .

Mots clés

Speaker recognition Voice casting Voice similarity i-vector Video game

Domaines

Son [cs.SD] Informatique et langage [cs.CL]

Fichier principal

main.pdf (228.14 Ko)

confusion.pdf (81.39 Ko)

confusion_matrix.pdf (25.19 Ko)

poster.pdf (1.6 Mo)

presentation.pdf (6.28 Mo)

results.pdf (20.89 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Adrien Gresse : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01572151

Soumis le : vendredi 4 août 2017-18:45:21

Dernière modification le : vendredi 12 novembre 2021-11:18:03

Dates et versions

hal-01572151 , version 1 (04-08-2017)

Licence

Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales

Identifiants

HAL Id : hal-01572151 , version 1
DOI : 10.21437/Interspeech.2017-1311

Citer

Adrien Gresse, Mickael Rouvier, Richard Dufour, Vincent Labatut, Jean-Francois Bonastre. Acoustic Pairing of Original and Dubbed Voices in the Context of Video Game Localization. Interspeech, Aug 2017, Stockholm, Sweden. pp.2839-2843, ⟨10.21437/Interspeech.2017-1311⟩. ⟨hal-01572151⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

293 Consultations

591 Téléchargements

Acoustic Pairing of Original and Dubbed Voices in the Context of Video Game Localization

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager