Service interruption on Monday 11 July from 12:30 to 13:00: all the sites of the CCSD (HAL, EpiSciences, SciencesConf, AureHAL) will be inaccessible (network hardware connection).
Skip to Main content Skip to Navigation
Conference papers

Design Choices for X-vector Based Speaker Anonymization

Brij Mohan Lal Srivastava 1 Natalia Tomashenko 2 Xin Wang 3 Emmanuel Vincent 4 Junichi yamagishi 3 Mohamed Maouche 5 Aurélien Bellet 1 Marc Tommasi 6 
1 MAGNET - Machine Learning in Information Networks
Inria Lille - Nord Europe, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189
4 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
5 DRIM - Distribution, Recherche d'Information et Mobilité
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : The recently proposed x-vector based anonymization scheme converts any input voice into that of a random pseudo-speaker. In this paper, we present a flexible pseudo-speaker selection technique as a baseline for the first VoicePrivacy Challenge. We explore several design choices for the distance metric between speakers, the region of x-vector space where the pseudo-speaker is picked, and gender selection. To assess the strength of anonymization achieved, we consider attackers using an x-vector based speaker verification system who may use original or anonymized speech for enrollment, depending on their knowledge of the anonymization scheme. The Equal Error Rate (EER) achieved by the attackers and the decoding Word Error Rate (WER) over anonymized data are reported as the measures of privacy and utility. Experiments are performed using datasets derived from LibriSpeech to find the optimal combination of design choices in terms of privacy and utility.
Complete list of metadata

Cited literature [27 references]  Display  Hide  Download
Contributor : Brij Mohan Lal Srivastava Connect in order to contact the contributor
Submitted on : Saturday, July 25, 2020 - 6:15:55 PM
Last modification on : Tuesday, July 5, 2022 - 8:38:39 AM


Files produced by the author(s)


  • HAL Id : hal-02610447, version 2
  • ARXIV : 2005.08601


Brij Mohan Lal Srivastava, Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Junichi yamagishi, et al.. Design Choices for X-vector Based Speaker Anonymization. INTERSPEECH 2020, International Speech Communication Association (ISCA), Oct 2020, Shanghai, China. ⟨hal-02610447v2⟩



Record views


Files downloads