Skip to Main content Skip to Navigation

The IRIT-UPS system @ ZeroSpeech 2017

Abstract : In this report, we describe the IRIT-UPS approach for the unsupervised discovery of sublexical units in speech in the framework of the Zero Resource Speech Challenge 2017 edition. We derive unsupervised representations that consist of the distances between the MFCC vectors extracted from the speech signal and a hundred cluster centroids estimated on millions of these vectors with an efficient scalable implementation of the k-means algorithm. We show that using a whitening transformation (ZCA) to pre-process the MFCCs is crucial to outperform the baseline MFCC representation. Furthermore, we obtain slight but consistent improvements by performing data selection before estimating the centroids. This simple approach yielded competitive results for the within-speaker condition but generalized less well in the betweenspeaker one.
Document type :
Complete list of metadata
Contributor : Françoise Grélaud Connect in order to contact the contributor
Submitted on : Thursday, February 4, 2021 - 4:53:13 PM
Last modification on : Monday, July 4, 2022 - 9:29:43 AM


  • HAL Id : hal-03131886, version 1


Thomas Pellegrini, Céline Manenti, Julien Pinquier. The IRIT-UPS system @ ZeroSpeech 2017. [Research Report] IRIT. 2017. ⟨hal-03131886⟩



Record views