Purely Corpus-based Automatic Conversation Authoring

Abstract : This paper presents an automatic corpus-based process to author an open-domain conversational strategy usable both in chatterbot systems and as a fallback strategy for out-of-domain human utterances. Our approach is implemented on a corpus of television drama subtitles. This system is used as a chatterbot system to collect a corpus of 41 open-domain textual dialogues with 27 human participants. The general capabilities of the system are studied through objective measures and subjective self-reports in terms of understandability, repetition and coherence of the system responses selected in reaction to human utterances. Subjective evaluations of the collected dialogues are presented with respect to amusement, engagement and enjoyability. The main factors influencing those dimensions in our chatterbot experiment are discussed.
Type de document :
Communication dans un congrès
Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Sara Goggi and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Helene Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis. 10th edition of the Language Resources and Evaluation Conference (LREC), May 2016, Portorož, Slovenia. pp.ISBN: 978-2-9517408-9-1, 2016, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). 〈http://lrec2016.lrec-conf.org/en/〉
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01309202
Contributeur : Guillaume Dubuisson Duplessis <>
Soumis le : vendredi 29 avril 2016 - 09:52:24
Dernière modification le : jeudi 20 juillet 2017 - 09:26:00
Document(s) archivé(s) le : mardi 15 novembre 2016 - 17:43:36

Fichier

lrec2016.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01309202, version 1

Citation

Guillaume Dubuisson Duplessis, Vincent Letard, Anne-Laure Ligozat, Sophie Rosset. Purely Corpus-based Automatic Conversation Authoring. Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Sara Goggi and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Helene Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis. 10th edition of the Language Resources and Evaluation Conference (LREC), May 2016, Portorož, Slovenia. pp.ISBN: 978-2-9517408-9-1, 2016, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). 〈http://lrec2016.lrec-conf.org/en/〉. 〈hal-01309202〉

Partager

Métriques

Consultations de la notice

310

Téléchargements de fichiers

152