Automatic Dialog Acts Recognition based on Words Clusters

Pavel Kral 1 Jana Kleckova 1 Christophe Cerisara 2
2 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper deals with automatic dialog acts (DAs) recognition in Czech. A Dialog act is defined by J. L. Austin [1] as a meaning of an utterance at the level of illocutionary force. The four following DAs are considered: statements, orders, yes/no questions and other questions. In our previous works, we proposed, implemented and evaluated two new approaches to automatic DAs recognition based on sentence structure. These methods have been validated on a Czech corpus that simulates a task of train tickets reservation. The main goal of this paper is to propose a new approach to solve the problem of lack of training data for automatic DA recognition. This approach clusters the words in the sentence into several groups using maximization of mutual information between two neighbor word classes. The classification accuracy of the unigram model (our baseline approach) is 91 %. The proposed method, a clustered unigram model, reduces the DA error rate by 12 %
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [12 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00086310
Contributor : Pavel Kral <>
Submitted on : Tuesday, July 18, 2006 - 2:18:04 PM
Last modification on : Friday, February 9, 2018 - 1:20:01 PM
Document(s) archivé(s) le : Tuesday, April 6, 2010 - 12:14:11 AM

Identifiers

  • HAL Id : hal-00086310, version 1

Collections

Citation

Pavel Kral, Jana Kleckova, Christophe Cerisara. Automatic Dialog Acts Recognition based on Words Clusters. 2006, 6 p. ⟨hal-00086310⟩

Share

Metrics

Record views

342

Files downloads

181