Topological representation of speech for speaker recognition

Gabriel H. Sierra; Jean-François Bonastre; Driss Matrouf; José Ramon Calvo

Communication Dans Un Congrès Année : 2010

Topological representation of speech for speaker recognition

(1) , (1) , (1) ,

Gabriel H. Sierra

Fonction : Auteur

Laboratoire Informatique d'Avignon

Jean-François Bonastre

Fonction : Auteur
PersonId : 172421
IdHAL : jean-francois-bonastre
ORCID : 0000-0001-7741-3346
IdRef : 079112978

Laboratoire Informatique d'Avignon

Driss Matrouf

Fonction : Auteur
PersonId : 176307
IdHAL : driss-matrouf
IdRef : 137773439

Laboratoire Informatique d'Avignon

José Ramon Calvo

Fonction : Auteur

Résumé

During last decade, researchers in speaker recognition have been working over the same acoustic space, regardless of whether the data lie on a linear space or not. Our proposal is to take into account the inner geometric structure of the speech in order to obtain a new space with a better representation of the speech data. A topological approach based on manifolds obtained thanks to Laplacian and Isomap algorithms is proposed. In this first work, the proposal is evaluated in terms of dimension reduction of the supervector space, known to have a high redundancy. The experiments are done in the NIST-SRE framework. It appears that the proposed approach allows to reduce by a factor four the dimension of the supervector space without losses in terms of EER. This first result highlights the potential of topological approaches for speaker recognition.

Mots clés

Index Terms: speaker recognition topological information di-mension reduction

Domaines

Informatique [cs]

bibliothèque Universitaire Déposants HAL-Avignon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01320360

Soumis le : lundi 23 mai 2016-17:13:23

Dernière modification le : mardi 14 janvier 2020-10:38:06

Dates et versions

hal-01320360 , version 1 (23-05-2016)

Identifiants

HAL Id : hal-01320360 , version 1

Citer

Gabriel H. Sierra, Jean-François Bonastre, Driss Matrouf, José Ramon Calvo. Topological representation of speech for speaker recognition. INTERSPEECH, Sep 2010, Makuhari, Japan. ⟨hal-01320360⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

29 Consultations

0 Téléchargements

Topological representation of speech for speaker recognition

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager