Sequential Decision Strategies for Machine Interpretation of Speech

Abstract : —Recognition errors made by automatic speech recognition (ASR) systems may not prevent the development of useful dialogue applications if the interpretation strategy has an intro-spection capability for evaluating the reliability of the results. This paper proposes an interpretation strategy which is particularly effective when applications are developed with a training corpus of moderate size. From the lattice of word hypotheses generated by an ASR system, a short list of conceptual structures is obtained with a set of finite state machines (FSM). Interpretation or a rejection decision is then performed by a tree-based strategy. The nodes of the tree correspond to elaboration-decision units containing a redundant set of classifiers. A decision tree based and two large margin classifiers are trained with a development set to become interpretation knowledge sources. Discriminative training of the classifiers selects linguistic and confidence-based features for contributing to a cooperative assessment of the reliability of an interpretation. Such an assessment leads to the definition of a limited number of reliability states. The probability that a proposed interpretation is correct is provided by its reliability state and transmitted to the dialogue manager. Experimental results are presented for a telephone service application.
Type de document :
Article dans une revue
IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2007, 〈10.1109/TASL.2006.876862〉
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01314620
Contributeur : Bibliothèque Universitaire Déposants Hal-Avignon <>
Soumis le : mercredi 11 mai 2016 - 16:28:02
Dernière modification le : mardi 18 décembre 2018 - 08:04:08

Lien texte intégral

Identifiants

Collections

Citation

Christian Raymond, Frédéric Béchet, Nathalie Camelin, Renato De Mori, Géraldine Damnati. Sequential Decision Strategies for Machine Interpretation of Speech. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2007, 〈10.1109/TASL.2006.876862〉. 〈hal-01314620〉

Partager

Métriques

Consultations de la notice

166