Conceptual decoding from word lattices: application to the spoken dialogue corpus MEDIA

Abstract : Within the framework of the French evaluation program MEDIA on spoken dialogue systems, this paper presents the methods proposed at the LIA for the robust extraction of basic conceptual constituents (or concepts) from an audio message. The conceptual decoding model proposed follows a stochastic paradigm and is directly integrated into the Automatic Speech Recognition (ASR) process. This approach allows us to keep the probabilistic search space on sequences of words produced by the ASR module and to project it to a probabilistic search space of sequences of concepts. This paper presents the first ASR results on the French spoken dialogue corpus MEDIA, available through ELDA. The experiments made on this corpus show that the performance reached by our approach is better than the traditional sequential approach that looks first for the best sequence of words before looking for the best sequence of concepts.
Document type :
Conference papers
Complete list of metadatas

Cited literature [8 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01160181
Contributor : Christophe Servan <>
Submitted on : Thursday, June 4, 2015 - 4:15:29 PM
Last modification on : Saturday, March 23, 2019 - 1:22:45 AM
Long-term archiving on : Tuesday, September 15, 2015 - 11:01:52 AM

File

FB_2006_INTERSPEECH_1.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : hal-01160181, version 1

Collections

Citation

Christophe Servan, Christian Raymond, Frédéric Béchet, Pascal Nocera. Conceptual decoding from word lattices: application to the spoken dialogue corpus MEDIA. The Ninth International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), Sep 2006, Pittsburgh, United States. ⟨hal-01160181⟩

Share

Metrics

Record views

144

Files downloads

226