I-vector based Representation of Highly Imperfect Automatic Transcriptions

Abstract : The performance of Automatic Speech Recognition (ASR) systems drops dramatically when used in noisy environments. Speech analytics suffer from this poor quality of automatic transcriptions. In this paper, we seek to identify themes from dialogues of telephone conversation services using multiple topic-spaces estimated with a Latent Dirichlet Allocation (LDA) approach. This technique consists in estimating several topic models that offer different views of the document. Unfortunately, such a multi-model approach also introduces additional vari-abilities due to the model diversity. We propose to extract the useful information from the full model-set by using an i-vector based approach, previously developed in the context of speaker recognition. Experiments are conducted on the DECODA corpus , that contains records from the call center of the Paris Transportation Company. Results show the effectiveness of the proposed representation paradigm, our identification system reaching an accuracy of 84.7%, with a gain of 3.3 points compared to the baseline.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01318657
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon <>
Submitted on : Thursday, May 19, 2016 - 5:27:02 PM
Last modification on : Tuesday, July 2, 2019 - 5:38:02 PM

Identifiers

  • HAL Id : hal-01318657, version 1

Collections

Citation

Mohamed Morchid, Mohamed Bouallegue, Richard Dufour, Georges Linarès, Driss Matrouf, et al.. I-vector based Representation of Highly Imperfect Automatic Transcriptions. INTERSPEECH, May 2014, Singapore, Singapore. ⟨hal-01318657⟩

Share

Metrics

Record views

67