Conceptual decoding from word lattices: application to the spoken dialogue corpus MEDIA

Abstract : Within the framework of the French evaluation program MEDIA on spoken dialogue systems, this paper presents the methods proposed at the LIA for the robust extraction of basic conceptual constituents (or concepts) from an audio message. The conceptual decoding model proposed follows a stochastic paradigm and is directly integrated into the Automatic Speech Recognition (ASR) process. This approach allows us to keep the probabilistic search space on sequences of words produced by the ASR module and to project it to a probabilistic search space of sequences of concepts. This paper presents the first ASR results on the French spoken dialogue corpus MEDIA, available through ELDA. The experiments made on this corpus show that the performance reached by our approach is better than the traditional sequential approach that looks first for the best sequence of words before looking for the best sequence of concepts.
Document type :
Conference papers
Complete list of metadatas

Cited literature [8 references]  Display  Hide  Download
Contributor : Christophe Servan <>
Submitted on : Thursday, June 4, 2015 - 4:15:29 PM
Last modification on : Saturday, March 23, 2019 - 1:22:45 AM
Long-term archiving on : Tuesday, September 15, 2015 - 11:01:52 AM


Publisher files allowed on an open archive


  • HAL Id : hal-01160181, version 1



Christophe Servan, Christian Raymond, Frédéric Béchet, Pascal Nocera. Conceptual decoding from word lattices: application to the spoken dialogue corpus MEDIA. The Ninth International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), Sep 2006, Pittsburgh, United States. ⟨hal-01160181⟩



Record views


Files downloads