Subspace Gaussian Mixture Models for Dialogues Classification

Mohamed Bouallegue; Mohamed Morchid; Richard Dufour; Driss Matrouf; Georges Linarès; Renato de Mori

Communication Dans Un Congrès Année : 2014

Subspace Gaussian Mixture Models for Dialogues Classification

(1) , (1) , (1) , (1) , (1) , (2, 1)

1
2

Mohamed Bouallegue

Fonction : Auteur
PersonId : 772200
IdRef : 177675128

Laboratoire Informatique d'Avignon

Mohamed Morchid

Fonction : Auteur
PersonId : 21451
IdHAL : morchid
ORCID : 0000-0002-4427-2468
IdRef : 188328343

Laboratoire Informatique d'Avignon

Richard Dufour

Fonction : Auteur
PersonId : 178348
IdHAL : richard-dufour
ORCID : 0000-0003-1203-9108

Laboratoire Informatique d'Avignon

Driss Matrouf

Fonction : Auteur
PersonId : 176307
IdHAL : driss-matrouf
IdRef : 137773439

Laboratoire Informatique d'Avignon

Georges Linarès

Fonction : Auteur
PersonId : 4977
IdHAL : georges-linares
IdRef : 079368794

Laboratoire Informatique d'Avignon

Renato de Mori

Fonction : Auteur
PersonId : 981954

McGill University = Université McGill [Montréal, Canada]

Laboratoire Informatique d'Avignon

Résumé

The main objective of this paper is to identify themes from dialogues of telephone conversations in a real-life customer care service. In order to capture significant semantic content in spite of high expression variability, features are extracted in a large number of hidden spaces constructed with a Latent Dirichlet Allocation (LDA) approach. Multiple views of a spoke document can then be represented with several hidden topic models. Nonetheless, the model diversity due to the multi-model approach introduces a new type of variability. An approach is proposed based on features extracted in a common homogenous subspace with the purpose of reducing the multi-span representation variability. A Gaussian Mixture Model subspace model, inspired by previous work on speaker identification, is proposed for theme identification. This representation, novel for theme classification, is compared with the direct application of multiple topic-model representations. Experiments are reported using a corpus collected in the call center of the Paris Transportation Service. Results show the effectiveness of the proposed representation paradigm with a theme identification accuracy of 78.8%, showing a significant improvement with respect to previous results on the same corpus.

Mots clés

Index Terms: Human/Human conversation analysis theme identification LDA features GMM subspace Latent Dirichlet Allocation

Domaines

Informatique [cs]

bibliothèque Universitaire Déposants HAL-Avignon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01313132

Soumis le : lundi 9 mai 2016-15:49:55

Dernière modification le : mardi 22 mars 2022-14:40:01

Dates et versions

hal-01313132 , version 1 (09-05-2016)

Identifiants

HAL Id : hal-01313132 , version 1

Citer

Mohamed Bouallegue, Mohamed Morchid, Richard Dufour, Driss Matrouf, Georges Linarès, et al.. Subspace Gaussian Mixture Models for Dialogues Classification. Interspeech, May 2014, Singapore, Singapore. ⟨hal-01313132⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

46 Consultations

0 Téléchargements

Subspace Gaussian Mixture Models for Dialogues Classification

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager