Skip to Main content Skip to Navigation
Journal articles

Sequential Decision Strategies for Machine Interpretation of Speech

Abstract : —Recognition errors made by automatic speech recognition (ASR) systems may not prevent the development of useful dialogue applications if the interpretation strategy has an intro-spection capability for evaluating the reliability of the results. This paper proposes an interpretation strategy which is particularly effective when applications are developed with a training corpus of moderate size. From the lattice of word hypotheses generated by an ASR system, a short list of conceptual structures is obtained with a set of finite state machines (FSM). Interpretation or a rejection decision is then performed by a tree-based strategy. The nodes of the tree correspond to elaboration-decision units containing a redundant set of classifiers. A decision tree based and two large margin classifiers are trained with a development set to become interpretation knowledge sources. Discriminative training of the classifiers selects linguistic and confidence-based features for contributing to a cooperative assessment of the reliability of an interpretation. Such an assessment leads to the definition of a limited number of reliability states. The probability that a proposed interpretation is correct is provided by its reliability state and transmitted to the dialogue manager. Experimental results are presented for a telephone service application.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01314620
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon <>
Submitted on : Wednesday, May 11, 2016 - 4:28:02 PM
Last modification on : Tuesday, January 14, 2020 - 10:38:06 AM

Identifiers

Collections

Citation

Christian Raymond, Frédéric Béchet, Nathalie Camelin, Renato de Mori, Géraldine Damnati. Sequential Decision Strategies for Machine Interpretation of Speech. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2007, ⟨10.1109/TASL.2006.876862⟩. ⟨hal-01314620⟩

Share

Metrics

Record views

219