Speaker information modification in the VoicePrivacy 2020 toolchain

Pierre Champion; Denis Jouvet; Anthony Larcher

Rapport (Rapport De Recherche) Année : 2020

Speaker information modification in the VoicePrivacy 2020 toolchain

(1) , (1) , (2)

1
2

Pierre Champion

Fonction : Auteur

Speech Modeling for Facilitating Oral-Based Communication

Denis Jouvet

Fonction : Auteur
PersonId : 15904
IdHAL : denis-jouvet
IdRef : 029418666

Speech Modeling for Facilitating Oral-Based Communication

Anthony Larcher

Fonction : Auteur
PersonId : 20105
IdHAL : anthony-larcher
ORCID : 0000-0003-4398-0224
IdRef : 139544569

Laboratoire d'Informatique de l'Université du Mans

Résumé

This paper presents a study of the baseline system of the VoicePrivacy 2020 challenge. This baseline relies on a voice conversion system that aims at separating speaker identity and linguistic contents for a given speech utterance. To generate an anonymized speech waveform, the neural acoustic model and neural waveform model use the related linguistic content together with a selected pseudo-speaker identity. The linguistic content is estimated using bottleneck features extracted from a triphone classifier while the speaker information is extracted then modified to target a pseudo-speaker identity in the x-vector's space. In this work, we first proposed to replace the triphone-based bottleneck features extractor that requires supervised training by an end-to-end Automatic Speech Recognition (ASR) system. In this framework, we explored the use of adver-sarial and semi-adversarial training to learn linguistic features while masking speaker information. Last, we explored several anonymization schemes to introspect which module benefits the most from the generated pseudo-speaker identities.

Mots clés

VoicePrivacy 2020 Challenge Speaker anonymization Speech recognition

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

MultiSpeech.pdf (256.39 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Pierre CHAMPION : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02995855

Soumis le : lundi 9 novembre 2020-13:20:52

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : mercredi 10 février 2021-18:58:07

Dates et versions

hal-02995855 , version 1 (09-11-2020)

Identifiants

HAL Id : hal-02995855 , version 1

Citer

Pierre Champion, Denis Jouvet, Anthony Larcher. Speaker information modification in the VoicePrivacy 2020 toolchain. [Research Report] INRIA Nancy, équipe Multispeech; LIUM - Laboratoire d'Informatique de l'Université du Mans. 2020. ⟨hal-02995855⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LEMANS GRID5000 UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD LARA LIUM SILECS

273 Consultations

268 Téléchargements

Speaker information modification in the VoicePrivacy 2020 toolchain

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager