Unsupervised regularization of the embedding extractor for robust language identification

Raphaël Duroselle; Denis Jouvet; Irina Illina

Communication Dans Un Congrès Année : 2020

Unsupervised regularization of the embedding extractor for robust language identification

(1) , (1) , (1)

Raphaël Duroselle

Fonction : Auteur

Speech Modeling for Facilitating Oral-Based Communication

Denis Jouvet

Fonction : Auteur
PersonId : 15904
IdHAL : denis-jouvet
IdRef : 029418666

Speech Modeling for Facilitating Oral-Based Communication

Irina Illina

Fonction : Auteur
PersonId : 15663
IdHAL : irina-illina
IdRef : 120731746

Speech Modeling for Facilitating Oral-Based Communication

Résumé

State-of-the-art spoken language identification systems are constituted of three modules: a frame-level feature extractor, a segment-level embedding extractor and a final classifier. The performance of these systems degrades when facing mismatch between training and testing data. Most domain adaptation methods focus on adaptation of the final classifier. In this article , we propose a model-based unsupervised domain adaptation of the segment-level embedding extractor. The approach consists in a modification of the loss function used for training the embedding extractor. We introduce a regularization term based on the maximum mean discrepancy loss. Experiments were performed on the RATS corpus with transmission channel mismatch between telephone and radio channels. We obtained the same language identification performance as supervised training on the target domains but without using labeled data from these domains.

Domaines

Réseau de neurones [cs.NE]

Fichier principal

odyssey_corrections_publication.pdf (929.36 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Raphaël Duroselle : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02544156

Soumis le : jeudi 16 avril 2020-09:27:36

Dernière modification le : lundi 11 septembre 2023-17:41:19

Dates et versions

hal-02544156 , version 1 (16-04-2020)

Identifiants

HAL Id : hal-02544156 , version 1

Citer

Raphaël Duroselle, Denis Jouvet, Irina Illina. Unsupervised regularization of the embedding extractor for robust language identification. Odyssey 2020 - The Speaker and Language Recognition Workshop, Nov 2020, Tokyo, Japan. ⟨hal-02544156⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA GRID5000 UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD SILECS

141 Consultations

228 Téléchargements

Unsupervised regularization of the embedding extractor for robust language identification

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager