Spoken Language Understanding in a Latent Topic-based Subspace

Mohamed Morchid; Mohamed Bouaziz; Waad Ben Kheder; Killian Janod; Pierre-Michel Bousquet Bousquet; Richard Dufour; Georges Linares

doi:10.21437/Interspeech.2016-50

Communication Dans Un Congrès Année : 2016

Spoken Language Understanding in a Latent Topic-based Subspace

(1) , (2) , (1) , (1) , (1) , (1) , (1)

1
2

Mohamed Morchid

Fonction : Auteur
PersonId : 21451
IdHAL : morchid
ORCID : 0000-0002-4427-2468
IdRef : 188328343

Laboratoire Informatique d'Avignon

Mohamed Bouaziz

Fonction : Auteur

Département de Recherche en Ingéniérie des Véhicules pour l'Environnement

Waad Ben Kheder

Fonction : Auteur

Laboratoire Informatique d'Avignon

Killian Janod

Fonction : Auteur

Laboratoire Informatique d'Avignon

Pierre-Michel Bousquet Bousquet

Fonction : Auteur

Laboratoire Informatique d'Avignon

Richard Dufour

Fonction : Auteur
PersonId : 178348
IdHAL : richard-dufour
ORCID : 0000-0003-1203-9108

Laboratoire Informatique d'Avignon

Georges Linares

Fonction : Auteur
PersonId : 4977
IdHAL : georges-linares
IdRef : 079368794

Laboratoire Informatique d'Avignon

Résumé

Performance of spoken language understanding applications declines when spoken documents are automatically transcribed in noisy conditions due to high Word Error Rates (WER). To improve the robustness to transcription errors, recent solutions propose to map these automatic transcriptions in a latent space. These studies have proposed to compare classical topic-based representations such as Latent Dirichlet Allocation (LDA), supervised LDA and author-topic (AT) models. An original compact representation, called c-vector, has recently been introduced to walk around the tricky choice of the number of latent topics in these topic-based representations. Moreover, c-vectors allow to increase the robustness of document classification with respect to transcription errors by compacting different LDA representations of a same speech document in a reduced space and then compensate most of the noise of the document representation. The main drawback of this method is the number of sub-tasks needed to build the c-vector space. This paper proposes to both improve this compact representation (c-vector) of spoken documents and to reduce the number of needed sub-tasks, using an original framework in a robust low dimensional space of features from a set of AT models called "Latent Topic-based Sub-space" (LTS). In comparison to LDA, the AT model considers not only the dialogue content (words), but also the class related to the document. Experiments are conducted on the DECODA corpus containing speech conversations from the call-center of the RATP Paris transportation company. Results show that the original LTS representation outperforms the best previous compact representation (c-vector), with a substantial gain of more than 2.5% in terms of correctly labeled conversations.

Mots clés

factor analysis author-topic model c-vector document clustering

Domaines

Informatique et langage [cs.CL]

Fichier principal

666cdeab5a16ebfa8902ad6b240992fd34a9.pdf (1 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Richard Dufour : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02356390

Soumis le : jeudi 14 novembre 2019-13:17:46

Dernière modification le : jeudi 7 septembre 2023-16:08:10

Archivage à long terme le : samedi 15 février 2020-12:50:04

Dates et versions

hal-02356390 , version 1 (14-11-2019)

Identifiants

HAL Id : hal-02356390 , version 1
DOI : 10.21437/Interspeech.2016-50

Citer

Mohamed Morchid, Mohamed Bouaziz, Waad Ben Kheder, Killian Janod, Pierre-Michel Bousquet Bousquet, et al.. Spoken Language Understanding in a Latent Topic-based Subspace. Interspeech 2016, Sep 2016, San Francisco, United States. ⟨10.21437/Interspeech.2016-50⟩. ⟨hal-02356390⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON UNIV-BOURGOGNE DRIVE LIA

50 Consultations

67 Téléchargements

Spoken Language Understanding in a Latent Topic-based Subspace

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager