A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding

Abstract : The extraction of flat concepts out of a given word sequence is usually one of the first steps in building a spoken language understanding (SLU) or dialogue system. This paper explores five different modelling approaches for this task and presents results on a French state-of-the-art corpus, MEDIA. Additionally, two log-linear modelling approaches could be further improved by adding morphologic knowledge. This paper goes beyond what has been reported in the literature, e.g. in (Raymond & Riccardi 07). We applied the models on the same training and testing data and used the NIST scoring toolkit to evaluate the experimental results to ensure identical conditions for each of the experiments and the comparability of the results. Using a model based on conditional random fields, we achieve a concept error rate of 11.8% on the MEDIA evaluation corpus.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01321122
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon <>
Submitted on : Wednesday, May 25, 2016 - 8:57:58 AM
Last modification on : Saturday, March 23, 2019 - 1:22:32 AM

Identifiers

  • HAL Id : hal-01321122, version 1

Collections

Citation

Stefan Hahn, Patrick Lehnen, Christian Raymond, Hermann Ney. A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding. LREC, May 2008, Marrakech, Morocco. ⟨hal-01321122⟩

Share

Metrics

Record views

52