Similarity Metric Based on Siamese Neural Networks for Voice Casting

Adrien Gresse; Mathias Quillot; Richard Dufour; Vincent Labatut; Jean-François Bonastre

doi:10.1109/ICASSP.2019.8683178

Communication Dans Un Congrès Année : 2019

Similarity Metric Based on Siamese Neural Networks for Voice Casting

(1) , (1) , (1) , (1) , (1)

Adrien Gresse

Fonction : Auteur
PersonId : 172309
IdHAL : adrien-gresse

Laboratoire Informatique d'Avignon

Mathias Quillot

Fonction : Auteur

Laboratoire Informatique d'Avignon

Richard Dufour

Fonction : Auteur
PersonId : 178348
IdHAL : richard-dufour
ORCID : 0000-0003-1203-9108

Laboratoire Informatique d'Avignon

Vincent Labatut

Fonction : Auteur
PersonId : 482
IdHAL : vlabatut
ORCID : 0000-0002-2619-2835
IdRef : 076951375

Laboratoire Informatique d'Avignon

Jean-François Bonastre

Fonction : Auteur
PersonId : 172421
IdHAL : jean-francois-bonastre
ORCID : 0000-0001-7741-3346
IdRef : 079112978

Laboratoire Informatique d'Avignon

Résumé

Dubbing contributes to a larger international distribution of multi-media documents. It aims to replace the original voice in a source language by a new one in a target language. For now, the target voice selection procedure, called voice casting, is manually performed by human experts. This selection is not exclusively based on acoustic similarity between the two voices. Actually, it is also supported by more subjective criteria such as the "color" of the voice, socio-cultural choices... The objective of this work is to model a voice similarity metric able to embed all the concerned voice characteristics , including the observers' receptive interests. In this paper, we propose a Siamese Neural Networks-based approach, measuring proximity between the original and dubbed voices. We propose an adapted jackknifing cross-validation method to evaluate our similarity model on unseen voices. The results show that we successfully capture information allowing two voices to be associated, with respect to the character's or role's abstract dimension.

Mots clés

Voice casting Similarity metric Siamese net-works i-vector

Domaines

Machine Learning [stat.ML]

Fichier principal

Siamese_neural_networks_based_similarity_metric_for_voice_casting.pdf (195.42 Ko)

Poster_ICASSP19.pdf (377.87 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Poster

Adrien Gresse : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02004762

Soumis le : samedi 2 février 2019-12:36:14

Dernière modification le : jeudi 1 février 2024-14:23:51

Dates et versions

hal-02004762 , version 1 (02-02-2019)

Identifiants

HAL Id : hal-02004762 , version 1
DOI : 10.1109/ICASSP.2019.8683178

Citer

Adrien Gresse, Mathias Quillot, Richard Dufour, Vincent Labatut, Jean-François Bonastre. Similarity Metric Based on Siamese Neural Networks for Voice Casting. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2019, Brighton, United Kingdom. pp.6585-6589, ⟨10.1109/ICASSP.2019.8683178⟩. ⟨hal-02004762⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA ANR

127 Consultations

1153 Téléchargements

Similarity Metric Based on Siamese Neural Networks for Voice Casting

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager