Topological representation of speech for speaker recognition

Abstract : During last decade, researchers in speaker recognition have been working over the same acoustic space, regardless of whether the data lie on a linear space or not. Our proposal is to take into account the inner geometric structure of the speech in order to obtain a new space with a better representation of the speech data. A topological approach based on manifolds obtained thanks to Laplacian and Isomap algorithms is proposed. In this first work, the proposal is evaluated in terms of dimension reduction of the supervector space, known to have a high redundancy. The experiments are done in the NIST-SRE framework. It appears that the proposed approach allows to reduce by a factor four the dimension of the supervector space without losses in terms of EER. This first result highlights the potential of topological approaches for speaker recognition.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01320360
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon <>
Submitted on : Monday, May 23, 2016 - 5:13:23 PM
Last modification on : Tuesday, July 2, 2019 - 5:38:02 PM

Identifiers

  • HAL Id : hal-01320360, version 1

Collections

Citation

Gabriel H. Sierra, Jean-François Bonastre, Driss Matrouf, José Ramon Calvo. Topological representation of speech for speaker recognition. INTERSPEECH, Sep 2010, Makuhari, Japan. ⟨hal-01320360⟩

Share

Metrics

Record views

32