Improving dialogue classification using a topic space representation and a Gaussian classifier based on the decision rule

Abstract : In this paper, we study the impact of dialogue representations and classification methods in the task of theme identification of telephone conversation services having highly imperfect automatic transcriptions. Two dialogue representations are firstly compared: the classical Term Frequency-Inverse Document Frequency with Gini purity criteria (TF-IDF-Gini) method and the Latent Dirichlet Allocation (LDA) approach. We then propose to study an original classification method that takes advantage of the LDA topic space representation , highlighted as the best dialogue representation. To do so, two assumptions about topic representation led us to choose a Gaussian process (GP) based method. This approach is compared with a Support Vector Machine (SVM) classification method. Results show that the GP approach is a better solution to deal with the multiple theme complexity of a dialogue, no matter the conditions studied (manual or automatic transcriptions). We finally discuss the impact of the topic space reduction on the classification accuracy.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01318674
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon <>
Submitted on : Thursday, May 19, 2016 - 5:47:41 PM
Last modification on : Saturday, March 23, 2019 - 1:22:42 AM

Identifiers

Collections

Citation

Mohamed Morchid, Richard Dufour, Pierre-Michel Bousquet, Mohamed Bouallegue, Georges Linarès, et al.. Improving dialogue classification using a topic space representation and a Gaussian classifier based on the decision rule. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2014, Florence, Italy. ⟨10.1109/ICASSP.2014.6853571⟩. ⟨hal-01318674⟩

Share

Metrics

Record views

60